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This thesis establishes a number of connections between foundational issues in quan- 
tum theory, and some quantum information applications. It starts with a review of 
quantum contextuality and non-locality, multipartite entanglement characterisation, 
and of a few quantum information protocols. 

Quantum non-locality and contextuality are shown to be essential for different im- 
plementations of quantum information protocols known as quantum random access 
codes and quantum communication complexity protocols. The simplest versions of 
these protocols are shown to be equivalent to tests of two- and multi-party contex- 
tuality and non-locality. From them I derive sufficient experimental conditions for 
tests of these quantum properties. 

I also discuss how the distribution of quantum information through quantum cloning 
processes can be useful in quantum computing. Regarding entanglement character- 
isation, some results are obtained relating two problems, that of additivity of the 
relative entropy of entanglement, and that of identifying different types of tripartite 
entanglement in the asymptotic regime of manipulations of many copies of a given 
state. 

The thesis ends with a description of an information processing task in which a 
single qubit substitutes for an arbitrarily large amount of classical communication. 
This result is interpreted in different ways: as a gap between quantum and classical 
computation space complexity; as a bound on the amount of classical communication 
necessary to simulate entanglement; and as a basic result on hidden- variable theories 
for quantum mechanics. In this case, the resource behind the quantum advantage 
is simply the real nature of the set of quantum pure states. I also show that the 
advantage of quantum over classical communication can be established in a feasible 
experiment. 
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Chapter 1 

Introduction 



The conceptual revolution brought to science by quantum theory is now almost a century old. 
Despite this grand old age, it still seems that the theory's full significance has not yet been 
appreciated outside a limited circle of physicists and philosophers of science. 

It is true that terms like 'quantum leap' and 'uncertainty principle' have been incorporated 
by everyday language. They are often used to convey concepts which are only remotely related 
to their physical meaning, but at least this shows that a wider public has taken note of some 
aspects of the interpretation of quantum mechanics. This timid acknowledgement, however, is 
completely overshadowed by the penetration of general relativity into mainstream culture, a 
theory of comparable age and importance. 

This acceptance of relativity's ideas by the general public shows up in popular science mag- 
azines and newspapers. Terms like black holes and space-time are presented as the theoretical 
constructs they are, and not just as misinformed analogies, as usually happens with quantum 
theory. I think this can be attributed at least to two factors. 

First, quantum theory's rupture with the previous theoretical paradigm was more radical 
than relativity's. A non-scientist may not have a good understanding of what space-time and 
black holes are, but at least he or she can be comforted in the knowledge that professional 
physicists apparently do. This is definitely not the case with quantum theory, whose interpreta- 
tion has always been a motive for argument even among experts. This seems to arise from the 
overturning of concepts preserved even by relativity, such as locality and perhaps determinism. 
This is why it seems quite appropriate to group the other great physical theories and name them 
'classical' (relativity included), while quantum theory remains in a class of its own. 

Another factor which keeps the discussion of the foundations of quantum theory away from 
a wider audience is the lack of practical applications. Relativity also has few widespread appli- 
cations, but this gap in public perception seems to have been filled by astronomical observations 
and thought experiments involving spaceships and black holes, for example. In comparison, 
quantum theory's applications seem much less convincing: the stability of matter and quantum 
tunnelling are two examples that come to my mind. Working applications of quantum theory's 
most baffling characteristics are not yet part of day-to-day life. 

In the last few years, the new field of quantum information has been changing that. Practical 
quantum information applications are just around the corner, with prototypes of quantum cryp- 
tographic setups and small-scale quantum computers already working in laboratories. These 
applications illustrate some of the counter-intuitive features of quantum theory in a particularly 
compelling way. A sign of this is the increasing use of these applications in introductory quantum 
mechanics courses. In a broader context, we see the public being engaged by the excitement of 
research in quantum physics, through television, general lectures and numerous popular science 
articles that have been appearing in the general press. 

By bringing the information-theoretic structure of quantum mechanics to the fore, these 
applications are very helpful to the researcher as well. Quantum information applications offer 
simple examples and procedures that illustrate important characteristics of quantum mechanics. 
As we will see in this thesis, these applications often suggest what is essential and what is 
accessory in quantum theory, highlighting features which may be of practical use and theoretical 
importance. 
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My research has been directed at examining these applications, and working out what they 
can tell us about foundational issues in quantum theory, such as contextuality and non-locality. 
It appears that taking into account the role of information in quantum theory will be unavoidable 
for major further developments. This is hinted at by the existence of information-based results 
(such as black hole entropy) which will need to be accounted for by some future unification of 
gravity and quantum mechanics. 

Of course, a deeper theoretical understanding of the information-theoretic aspects of the 
theory should also lead to new or improved applications. Another research goal of mine is to 
turn the insights coming from the foundations of quantum theory into useful quantum informa- 
tion applications that can hopefully be implemented in the near future. Given the theoretical 
promises and the current rate of experimental progress, quantum computing and other quantum 
technologies may assume a great economic importance in the next decades. 

Having stated two of my main research motivations, let me now give a preview of the work 
I have been doing along these lines in the last years, and which constitute this thesis. 

Chapters 2, 3 and 4 present introductory material respectively on the foundations of 
quantum theory, quantum entanglement, and a few quantum information applications. Most of 
the results here are drawn from the recent literature, but the choice of topics and the presentation 
is made so as to prepare terrain for chapters 5 to 9. The original material in the introductory 



chapters includes an analogy used to present Hardy's non-locality proof in section 2.3.2.1, and 
a curious fidelity balance result for cloning machines in section 4.3.4j . 

In chapter 5 I investigate the characterisation of tripartite entanglement, which is a much 
harder mathematical problem than the bipartite case. This problem is important as multi- 
particle entanglement shows up in natural systems, as well as in quantum information protocols 
such as the one I discuss in chapter ||. This chapter results from collaboration with Martin 
Plenio and Shashank Virmani from Imperial College (London). The work I report is mostly my 
own, while in the published version of these results [84] there are also analytical results obtained 
mostly thanks to my collaborators' considerable mathematical skills. Section results from 
numerical calculations performed in joint work with Leah Henderson from the University of 
Bristol. I also published a second paper on multipartite entanglement in joint work with Lucien 
Hardy l^lJl , but as its results did not fit in naturally with the rest of this thesis, I decided not 
to include them here. 

Encoding information in quantum systems involves some counter-intuitive constraints. One 
of these is the no-cloning theorem, which states that it is impossible to make perfect copies of 
an arbitrary unknown quantum state. Non-perfect but good-quality copying is feasible, but the 
question is: what are these quantum cloning machines good for? In chapter 6 I show that 
quantum cloning can be used to distribute quantum information in an efficient manner during 
quantum computations. This chapter resulted from collaboration with my supervisor Lucien 
Hardy, having been published in |82j . 

In chapter 7 I turn to a quantum information application known as quantum random 
access codes. We will see that some simple versions of these codes had already cropped up under 
different guises previously, without being noticed. I recognise the equivalence between these 
different protocols and show that their higher-than-classical performance derives from either 
quantum contextuality or non-locality. I also analyse the feasibility of implementing these codes 
experimentally, showing that a successful implementation would be equivalent to a contextuality 
or non-locality test. The quantum random access code using a qutrit in section 7.5.2 was found 
in joint work with Daniel Oi. 

In theory, quantum communication is better than classical communication for many tasks. 
However, given the difficulty of implementing quantum protocols experimentally, usually it is 
the classical protocols that outperform the quantum versions in practical setups. In chapter 8 I 
present solutions to a communication complexity problem which rely either on entanglement or 
on quantum communication. The analysis of this problem leads to sufficient conditions for test- 
ing multipartite non-locality and contextuality. Moreover, I show it is possible to demonstrate 
experimentally the advantage of quantum over classical communication in a feasible experiment. 
Some of these results appeared in preprint format [78], while others were published [js"o| . 

In chapter 9 I present a generalisation of the communication complexity problem discussed 
in chapter 8. I rigorously prove that a single qubit of quantum communication can be better 
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than any finite number of classical bits for a particular task. The result lends itself to different 
interpretations. First, I show it means that entanglement is very hard to simulate with classical 
communication. I also demonstrate that quantum computers can have an unbounded advantage 
in memory size with respect to classical computers. Interestingly, all these results can be shown 
to stem from a single axiom that distinguishes quantum theory from classical probability theory. 
This chapter represents joint work with my supervisor, and some of it has appeared in preprint 
format ^3|. Finally, in chapter 10 I offer some concluding remarks, and point at some open 
problems for further research. 

I hope the reader can recognise my two main research motivations in the treatment I give to 
the topics addressed in this thesis. In working on them I have acquired a more intuitive grasp 
of what quantum theory can offer for information processing, and why. Fortunately, the source 
of surprises seems to be endless. 



Chapter 2 



On the foundations of quantum 
theory 



Since the development of the mathematical formalism of quantum mechanics in the 1920's, the 
theory has established itself as the most thoroughly tested and successful physical theory ever. 
It provides solutions to many fundamental problems in physics, ranging from the stability of 
matter to the nuclear fusion reaction generating heat in our Sun. 

While its mathematical structure and predictive power can be precisely stated, quantum 
theory's conceptual foundations have been a motive for discussion for decades. This is un- 
derstandable, given the many fundamental differences between quantum theory and the great 
classical theories, such as thermodynamics and general relativity. In this chapter I will touch 
upon some of the known counter-intuitive features of quantum mechanics, reviewing some of 
the work done to clarify the conceptual foundations of the theory. Understanding these foun- 
dational issues is important in appreciating the beauty and subtlety of Nature itself, but also 
sheds light on why and how quantum information applications can be better than their classical 
counterparts. 

Maybe the first thing a student notices when introduced to quantum theory is that, unlike 
classical theories, quantum mechanical predictions are in general intrinsically probabilistic. Some 
of the founding fathers of quantum mechanics (notably Einstein) were very sceptical about 
this. They suspected that quantum theory was just an approximation to a more fundamental, 
deterministic theory, which they hoped would be developed in the future. Interestingly, after the 
work of de Broglie and Bohm, we know that a deterministic description of quantum phenomena is 
possible. This d eterministic (or causal) interpretation of quantum theory will be briefly sketched 
in section |2.1.1| . 

The discovery of a wholly deterministic interpretation of quantum theory was a herald of 
other important developments. Starting in the mid-1950's, some researchers have been able to 
understand better some of the unavoidable conceptual characteristics of any theory compatible 
with quantum mechanics. This remarkable achievement pointed out some surprising, highly 
counter-intuitive features of quantum theory. In this chapter we will briefly review some of what 
has been found so far by investigating the necessary limitations of more general hidden-variable 
theories compatible with quantum mechanics. 



In section 2.2 I will present some results showing that quantum mechanics is contextual. We 
will see this means that in general, the outcomes of a given observable A cannot be assigned 
pre-defined values, independently of which other commuting observables we choose to measure 
together with A. More than a mere curiosity of quantum theory, we will see in chapters [5] 
and U that the higher-than-classical performance of some quantum information protocols can be 
attributed to this quantum property. 

In section we will see how quantum non-locality arises naturally from contextuality. We 
will examine different types of proof of quantum non-locality, and present some material which 
will be relevant to the discussion of the uses of non-locality we will analyse in the remaining 
chapters of this thesis. 
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In chapters [/], ^ and |9] we will identify the fundamental quantum characteristics behind 
higher-than-classical performances in a number of information processing applications. We will 
see that the power of quantum systems with respect to classical systems can be traced back 
to fund ame ntal qua ntum traits that appear in axiomatic approaches to quantum theory. In 
sections 2A and 2J3 I will sketch some aspects of the work of Hardy, and Brukner and Zeilinger 
on the foundations of quantum theory. We will see that their approaches are directly relevant 
to understanding some quantum information processing protocols I will present later. 

In order to flesh out some of the most surprising of quantum theory's characteristics, we 
must start by presenting the hidden-variable theory research programme, and what it can tell 
us us about physical reality. 



2.1 Why hidden- variable theories? 

Many characteristics of quantum theory run against the physical intuition we developed in a 
mostly classical world. Quantum theory's description of Nature agrees more precisely with ex- 
periments than any previous physical theory, but at the same time its predictions are intrinsically 
probabilistic. 

A great deal of research has been done to try to find out whether this is a sign of quantum 
theory's incompleteness, or whether this non-determinism is a fundamental trait of Nature. 
Hidden-variable theories (HVT's) were built as attempts to find alternatives compatible with 
quantum mechanics, but with different conceptual foundations. In particular, many attempts 
have been made to build hidden- variable theories which could predict quantum phenomena while 
retaining the determinism and locality of classical physics. 

HVT's are important for many reasons. It is productive for theoreticians to view a problem 
through more than one alternative conceptual structure, provided by different HVT's. This 
helps one to develop an intuition about quantum phenomena. This is particularly helpful be- 
cause of the apparent inadequacy of classical intuition to account for many important quantum 
characteristics. Alternative HVT's put familiar problems in a new light, and may suggest new 
or improved applications in quantum information theory, for example. 

They also help us to identify the need (or not) for certain conceptual characteristics in any 
possible acceptable future extension of quantum mechanics. This is an indirect way of making 
explicit some possibly subtle traits of Nature, which may then be brought to the fore in future 
theories superseding quantum mechanics. 

In the next section we will briefly comment on the success of creating a deterministic HVT 
for quantum mechanics (Bohmian mechanics). This shows that an inherently probabilistic de- 
scription is not necessary. 

By exploring hidden- variable theories, we will see in subsequent sections that there are some 
characteristics which are unavoidable in any HVT compatible with quantum mechanics, for ex- 
ample contextuality and non-locality. In following chapters we will see how these characteristics 
can be explored in quantum information processing tasks. 

2.1.1 A deterministic hidden-variable theory is possible 

In his influential book Mathematical foundations of quantum mechanics p04| 1, John von Neu- 
mann claimed to have proven that quantum mechanics could not be replaced by any alternative 
deterministic HVT. For many years this seems to have blocked progress in this area. 

In 1966 Bell explicitly built a toy deterministic HVT describing any projective measure- 
ment on a single qubit fl3|| . Bell refuted von Neumann's claims in a simple but bold manner, 
showing that von Neumann's theorem only applied to a very restricted class of HVT's, which 
automatically ruled out the sort of successful HVT Bell came up with. Despite its conceptual 
importance, Bell's result was quite restricted: it did not account for Positive Operator Valued 
Measure (POVM)[] measurements, nor for the qubit 's dynamics. The question of whether it was 
possible to devise a deterministic HVT for general quantum mechanics (in arbitrary dimensions) 
remained open. 



For a description of the concept of POVM's see the excellent book by Peres [155. 



, section 9.5]. 
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Or rather, it did not: as Bell himself pointed out, this question had already been closed long 
before Bell's 1966 paper. Bell brought to attention the work of Bohm (l^, p0[ that proposed 
such a a theory in 1952, based on earlier work by de Broglie in the late 1920's [ p4 pi]] . For a long 
time Bohm's papers |pH| p0| were largely ignored by the physics community, for a conjunction 
of historical factors. 

In Bohm's causal (i.e. deterministic) interpretation of quantum theory, the whole experimen- 
tal set-up establishes a field which can be calculated from the wavefunction ip{t). Preparation of 
a quantum state, in this interpretation, consists of preparing an ensemble of states with different 
values of the hidden variable A. This hidden-variable is just the real position of the particle, and 
is distributed according to the initial probability distribution for finding the particle at position 
x, so it is proportional to \ip\ 2 - As time passes, the particles in this ensemble follow well-specified 
trajectories, 'guided' by the field which can be calculated from the time-evolved ip{t). 

Bohm showed that many measurements (spin measurements, for example) can be reduced 
to position measurements. In a Stern-Gerlach experiment, for example, each spin-^ particle 
follows a well-defined trajectory through the magnet, impinging on the end screen in one of two 
possible spots. The indeterminacy we see in the experiments arises from our lack of control over 
the preparation procedure: we always create an ensemble of states distributed according to the 
initial probability distribution. In the case of the particles in the Stern-Gerlach experiment, 
half of them follow trajectories leading to the upper spot, and half to the lower one. But each 
particle follows a well-defined trajectory, the indeterminacy arising from our lack of knowledge 
and control over the ensemble we prepare. This approach puts the quantum indeterminacy on 
the same footing as the indeterminacy in classical statistical mechanics ensembles. 

Bohmian mechanics has been further developed in recent years. For example, Bohmian 
trajectories have been calculated for the double-slit experiment [155]; the hydrogen atom fl07|] ; 
and to barrier-tunneling [ j66|| . 

By adopting Bohm's interpretation of quantum mechanics, we recover a deterministic theory. 
There is, however, a conceptual price to pay. We need to introduce hidden-variables which we 
are unable to manipulate, but which completely determine all the particles properties, through 
their trajectories. If these hidden-variables have a physical existence, one is allowed to think 
they will eventually become accessible in the future, after some technological advances. This 
may be an impossible dream, however; there has been research suggesting that manipulation 
of Bohm's hidden v ariables would result in problematic possibilities such as faster-than-light 
signalling p6jp5| . 

Independently of the interpretation we give for quantum theory, it retains counter-intuitive 
conceptual characteristics, such as contextuality and non-locality. In the next sections we will 
briefly explore these two fundamental characteristics of quantum mechanics. We will see that 
they can be harnessed to obtain computation and communication gains in quantum information 
applications. 



2.2 Quantum contextuality 

Let us consider a quantum system in a pure state Mathematically, \ip) can be fully charac- 
terised by a complete set S of mutually commuting operators (called observables) of which \tp) 
is a simultaneous eigenvector, together with the eigenvalues that will be found if each of those 
is measured. The usual physical interpretation is that each observable corresponds to a physical 
property being measured, such as angular momentum, position, and so on. Being a simultaneous 
eigenstate of the set S of observables, whenever \tp) is measured for these properties the results 
will be predictable and the same, corresponding to one particular eigenvalue of each observable 
O; < S. 

By analogy with classical physics, when we physically implement the measurement of observ- 
able Oi with a suitable physical apparatus, we say we are measuring the system's property which 
is defined by Oi. For example, we say that we are measuring the electron's intrinsic angular 
momentum along the z direction when we send through an electron through a Stern-Gerlach 
magnet oriented along z. 

A set S of observables completely characterises a pure state when the outcomes of all the 
observables in S are predictable. We tend to say that all the properties of the system are known, 
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and also tend to think that these properties exist in principle independently from each other. 
This intuition is corroborated by performing different experiments on multiple copies of the same 
state. Each experiment can measure a different subset of the observables in S; we still obtain 
the same, predictable outcomes, independently on the experimental context (i.e. independently 
on the particular sub-set of observables Oi £ S chosen to be measured in each experiment). 

But now suppose that we are given an unknown pure state, prepared by another party. Now 
we have no guarantee that it will be an eigenstate of any particular set of observables that we 
decide to measure. Still, our classical intuition leads us to expect that even so, the result we 
will obtain when measuring observable Ok is independent on the experimental context, i.e. on 
which other commuting observables I decide to simultaneously measure together with observable 
Ok- This intuition fits with our attribution of pre-defined properties to the system, which are 
independent of each other and which can be measured selectively at will. 

This non-contextuality hypothesis can be shown to be false for quantum mechanics. Let us 
now review some proofs of quantum contextuality, and briefly discuss the significance of this 
result. 



2.2.1 Two simple contextuality proofs 

When we consider the typical situation relevant to Heisenberg's principle, one produces a state 
and tries to measure observables which do not commute. Then we find that measurement 
results seem to indicate that there is no predefined value for the outcomes of non-commuting 
observables, prior to the measurement. 

This non-existence of pre-defined properties, however, may be just an enticing suggestion of 
the usual Copenhagen interpretation of quantum theory. Instead of just accepting this suggestion 
unquestionably, it turns out that we can actually prove that these values cannot be pre-defined 
non-contextually. In this section we will review two simple contextuality proofs, due to Peres 
[ |154| ] and Mermin |144| ] . We start with Peres' proof. 



2.2.1.1 Peres' contextuality proof 

Let us consider a particle in the singlet state: 

|0-> = -^(|O j1 1b>-|UOb», (2-1) 

where |0) • and |1) ■ represent two orthogonal states of party j's subsystem. This is an example of 
a canonical maximally entangled pair of qubits, also called an Einstein- Podolsky- Rosen (EPR) 
pair, or alternatively a Bell state. 

This state is a simultaneous eigenvector of observables which can be conveniently written 
using the three Pauli matrices: 

° 1). «-( l o -il (-> 





' 






o- x = I 






» a v = ( 



State (2.1) is an eigenvector (with eigenvalue -1) of the observables o x \ ® a X 2, o~ y i <8> o~ y 2, 
and a z \ tg) o~ Z 2, where the sub-indices indicate which particle (1 or 2) we are referring to. The 
non-contextuality assumption tells us that the outcomes for each one-particle observable are 
pre-defined, and do not depend on which other commuting operators we decide to measure. For 
example, the outcome x± = ±1 that we obtain by measuring a x \ must not depend on whether 
we decide to measure a X 2, cr y 2 or a Z 2, since these three operators commute with a x \. By giving 
these hypothetical measurement outcomes definite values x\, X2, yi, U2, z\, Z2-, we are accepting 
the classical idea that all a quantum measurement does is to reveal a pre-existing property of a 
system, independent of other properties we may wish to measure. 

Now, the fact that a singlet is an eigenvector with eigenvalue -1 of the three operators above 
means there is a constraint on these hypothetical pre-existing values: 



xix 2 = y\V2 = ziz 2 = -1. 



(2.3) 
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Let us consider the outcomes on some other possible measurements on this state. Consider 
the observables A = u x \ (g a y 2 and B = a y \ (g a X 2- Note that [A, B] = 0, which means that 
we can measure A and B simultaneously. This can be done in two ways. One possibility is to 
measure the observable AB directly. To see what the measurement outcome will be, note that 



AB = a xl <g OylVyX ® &x2 = &zi <g cr z2 , (2.4) 

which, as we have seen, always results in -1 for the singlet state. 

The second way to measure the product AB is to measure A and B separately, and multiply 
the results. Quantum mechanics predicts that a simultaneous measurement of A and B will 
yield results consistent with the prediction above. The non-contextual assumption, on the other 
hand, tells us that the result of measuring A = a x \ (g a y 2 should be xiy2> and of me asu ring 
B = o y \ (g a X 2 should be yiX2, where x±, X2, yi and y2 are the same that appear in eq. ( |2.3| ), in 
a different context. Multiplying these two predicted measurement outcomes we obtain 

xiy2ViX2 = (x 1 x 2 )(yiy2) = (-1)(-1) = +1, (2.5) 
[because of constraint ( [2, 3D ], which is the exact opposite of the outcome w e ex perimentally 



measure (-1), as we have seen above. This contradiction, obtained by Peres in | 154 |, shows that 
we cannot assign definite results to outcomes of non-commuting observables in a non-contextual 
way. 



2.2.1.2 Mermin's contextuality proof 



The above proof relies on some prop erties of the particular state given by eq. ( |2.1| ). Building 
on Peres' simple proof, Mermin [144] was able to obtain a contextuality proof which is state- 
independent. It goes as follows. 

Consider a two-qubit system in any quantum state. Now consider the following array of 
operators: 



11 ®a z 


a z (g>l 


a z <g) a z 


a x (g 1 


1 ®a x 


a x (g a x 


a x (g a z 


<?z <g> cr x 


<Jy (g Oy 



Note that each of the nine operators have eigenvalues ±1, and that the three operators in 
each row or column commute. It is also easy to check that each operator is the product of the 
other two operators on the same row or column, with a single exception: the third operator in 
the third column equals minus the product of the other two. 

Suppose that there exist pre-defined values (1 or -1) for the outcomes of these nine opera- 
tors. Then we can substitute each of the nine operators above by the pre-defined value of its 
outcome. This leads us to an impossible situation: because of the minus sign that appears in 
the third column, there is no consistent way of assigning pre-defined values to the nine operators 
listed above. These pre-defined measurement outcomes simply cannot exist, if we do not make 
explicit reference to the complete set of commuting observables being measured in the particular 
experiment under consideration. 



2.2.2 Other results on contextuality 

The two proofs we have presented above make use of a 4-dimensional Hilbert space. Proofs 
using only 3-dimensional systems are also possible, and in fact appeared much earlier than the 
two proofs above. 

Building on earlier work by Gleason |9(J, Kochen and Specker |122| provided the first proof 
of quantum contextuality. They used a very complicated geometrical construct, involving 117 
operators on a 3-dimensional space. They were able to show that this set of projectors could 
not be consistently assigned values and 1 in a non-contextual way. I chose to present Peres' 
and Mermin's proofs because they are conceptually and algebraically much simpler than the 
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results of Kochen and Specker. For a clear disc ussio n of Kochen and Specker's theorem and its 
philosophical relevance, see the article by Held |104| 1, 

As we will soon see, quantum non-locality involves a special type of quantum contextuality. 
Because of the great number of applications of quantum entanglement that have been found 
recently, the discussion of non-locality has eclipsed that of quantum contextuality. Nevertheless, 
much of the discussion on non-locality can be relatively easily adapted to non-contextuality, 
particular many of the simple non-locality proofs are also contextuality proofs [p^, 91, 144 



in 



More recently, some effort has been put into devising very simple contextuality tests, with a 
view to experimental realization [45, |146| , |177f| . Generally, it is easier experimentally to perform 
a contextuality test than a non-locality one. We will further explore this issue in chapters |7| and 



2.2.3 Making sense of contextuality 

The proofs above tell us something very unusual about Nature. In classical physics the mea- 
surement of one property of a system does not interfere with measurements of other properties. 
We can thus think of different properties of a classical system as existing independently of our 
actual experimental setup. 

In quantum mechanics the situation is different. We have just seen that it is impossible to con- 
sistently assign pre-existing values to measurement outcomes of observables in a non-contextual 
way. This quantum property is called contextuality because the outcome of an observable A in 
general must depend on the experimental 'context', i.e. on which other commuting observables 
are measured together with A. In the array appearing in Mermin's proof, for example, each 
operator appears in two different contexts, representing two different possibilities of commuting 
operators to be measured simultaneously with it. The fact that we used a single symbol to rep- 
resent the outcome of an observable in these two different 'contexts' leads to the inconsistency 
pointed out in the proof. 

Quantum contextuality forbids us, in general, to assign pre-defined elements of reality to 
individual observables. This is in marked contrast to the classical case, where the momentum 
of an object has an objective existence, independently of its angular momentum, for example. 
Contextuality tells us that if we still want to stick to the idea that quantum systems have well- 
defined properties, the only consistent way of assigning them would be with respect to the full 
experimental set-up used to measure them. As remarked earlier, this is not so unreasonable, 
given that an apparatus to measure just the momentum of a particle is different from an ap- 
paratus that measures both momentum and angular momentum, for example. Of course, if we 
measure a complete set of commuting observables there is no contextuality possible, because by 
defin ition a complete set of observables provides the full experimental context. In Peres' book 
[|155| , chapter 7] , he suggests that we accept the ambiguity involved in defining the value of the 
measurement outcome of a single operator (or incomplete set of them), and simply refrain from 
saying a single observable reveals a well-defined property of a system. Only complete sets of 
commuting observables could be said to reveal objective properties of a system. 

In quantum information processing, it is sometimes convenient not to measure a complete 
set of operators, measuring only a few. This can be used to prepare a state, which is then sent 
to a second party. The measurements that the second party can perform will have contextual 
outcomes, i.e. dependent on the measurement outcomes previously obtained by the first party. 
This way, we can explore quantum contextuality as a resource for quantum information tasks. 
We will develop this idea in chapter 

Understanding that quantum mechanics is contextual was an important step towards re- 
alizing it is non-local as well. In the next section we discuss quantum non-locality, which is 
an essential ingredient to a number of quantum information applications we will investigate in 
future chapters. 
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2.3 Quantum non- locality 

In all of classical physics, the description of a system's state depends only on itself and its 
immediate surroundings; in other words, classical physics is local. In 1935 Einstein, Podolsky and 
Rosen (EPR) published an influential paper JF6| in which they showed that spatially extended 
quantum systems were apparently non-local. Fearing that this non-locality would conflict with 
causality, they concluded that quantum mechanics was necessarily incomplete, and would need 
to be replaced in the future with some hidden-variable theory which still preserved causality 
and locality. 

At some point it became clear that the apparent non-locality of quantum mechanics did not 
allow for faster-than-light communication. The next major step was taken by Bell, who in 1964 
showed that any hidden-variable theory compatible with quantum mechanics must necessarily 
be non-local fll2| . This provided a definitive negative answer to EPR's suggestion that quantum 
mechanics might still be compatible with a local theory yet to be developed. 

Conceptually, quantum non-locality follows naturally once we accept contextuality. In sec- 



tion 2.2 we have seen that the measurement outcome of an observable A depends not only on 
A and the state of the system, but also on the particular choice of commuting observable B to 
be measured together with A. This, in a nutshell, is the essence of what we call quantum con- 
textuality. It turns out that this contextuality is still present for commuting A and B that are 
each measured on spatially separated particles. The crucial difference now is that the outcome 
of observable A will necessarily depend on a distant (possibly even space-like separated) set-up 
to measure observable B. Quantum mechanics exhibits non- locality when it remains contextual 
even in this spatially separated setting. 

There have been many experimental tests of quantum non-locality, and although some possi- 
ble loopholes remain to be closed, the evidence has been overwhelmingly in support of quantum 
mechanical predictions. Besides its theoretical interest, many information processing and com- 
munication applications have been found for non-local effects, as we will see in the chapters 
ahead. In the rest of this section we will review some quantum non-locality proofs, which will 
be useful for the discussion in the rest of this thesis. 



2.3.1 The Clauser-Horne-Shimony-Holt inequality 



The first proof of quantum non-locality was provided by Bell in 1964 [12]. His proof involved 
devising an inequality that simple measurement outcomes on two distant systems needed to sat- 
isfy, if the outcomes at each location could not depend on the setting at the second location (i.e. 
if locality holds). In this section we will present a related inequality derived by Clauser, Home, 
Shimony and Holt (CHSH) in 1969 @. In chapter g we will see that quantum non-locality 
associated with violation of the CHSH inequality can be used in some quantum communication 
protocols. 

The inequality. Consider a situation in which we perform two-outcome (±1) measure- 
ments on two spatially separated systems A and B. It is enough to consider just two different 
measurements at each location. If the outcomes at A are independent on the settings at the 
distant B, then CHSH showed the following inequality must be satisfied |]51|: 



|<aibi) + (a 2 &i> + (a 2 6 2 ) - (ai& 2 )| < 2, (2.6) 

where a, and bj denote the outcomes of measurement i on system A and j on system B, and the 
() indicate an average over many runs of the experiment. A simple, if tedious, way to check that 
the inequality is true is to substitute all sixteen possible combinations of values for a\, a 2 , b%, 6 2 
in the left-hand side of the CHSH inequality ( |2.6| ) . 

Note that the inequality makes no reference to quantum mechanics, it just puts a bound on 
a particular combination of averages over dichotomic variables. By deriving an inequality which 
must be obeyed by averages of experimental runs, both Bell and CHSH made it possible a direct 
comparison of any local theory with quantum mechanics, which only predicts probabilities and 
averages (expected values of observables) . We will soon see that quantum mechanical predictions 



conflict with the CHSH locality inequality (2.6). 
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A different way of expressing inequality (|2.6|) results if we use p e (cii, bj) [p rf (a«, bj)] to denote 
the probability of obtaining equal [different] outcomes for measurements cubj. Then it is easy 
to obtain the following inequality from (|2.6|): 

p d {a 1 ,b 1 )+ P d {a2,b 1 )+p d (a2,b2)+p e (a 1 ,b 2 ) < 3. (2.7) 

These two inequalities are equivalent, and must be obeyed by two-outcome experiments predicted 
by any local theory. 

Quantum violation. Now let us see how quantum mechanics violates the inequalities 
above. For that, we will start with a maximally entangled state of two qubits, one in each 
spatial region A and B: 

\<f>-) = -L(\0 A l B )-\l A B )). (2.8) 

At each location, one out of two possible projective measurements will be chosen at ran- 
dom. Being projections, there are only two possible outcomes for each measurement, which 
we will denote by ±1. Let us represent the projector operators on each qubit by P(9,<j>) = 
|V>(0,0))W0, 0)1, where 

\ip(0, (/))) = cos(0/2) |0> + exp(z<£) sin(0/2) |1) . (2.9) 

This parametrization of pure states of a qubit assigns each point on a sphere (the 'Bloch sphere') 
to a unique pure quantum state. This qubit parametrization is very useful, and will be used 
many times in this thesis. 

Maximal violation of the CHSH inequality ( |2.6D happens when we consider the outcomes 
of projectors on qubits A and B that alternate on the same plane (crossing the sphere at its 
centre), at equally spaced angles separated by 7r/4 radians. For example, we can choose to 
measure the following set of operators: P ai = P(0,0), P a2 = P(tt/2, 0), P bl = P(vr/4,0), and 
Pb 2 = -P(37r/4, 0). For these observables we can easily calculate the following expectation values: 

<tf | P ai P bl \^>} = | P a2 P bl m = (* | Pa 2 Pb2 l*> = - (*l KK l*> = COs(^/4) (2.10) 

If we now assume that each two-qubit operator above is just revealing the pre-existing values 
ai = P ai \^>) and similarly for a,2, 6i, and 62, we can substitute those in the CHSH inequality 
(2.6), obtaining: 

(*| P ai P bl I*) + (*| P a2 P bl I*) + <tf| P a2 P b2 I*) - (*| P ai P b2 I*) I (2.11) 

= 4cos(vr/4) = 2V2 > 2, (2.12) 

which shows that the CHSH inequality is violated by these quantum measurements. 

Instead of using the expectation values in inequality ( |2.6D , we can equivalently substitute 
the outcome probabilities in inequality (2.7): 

Pq(a 1: bi) =p d q (a 2 ,b 1 ) =p d (a 2 ,b 2 ) =p e q (a u b 2 ) = cos 2 (^/8) (2.13) 

=> p d q { ai M) + P d q (a 2 , h) + p d q (a 2 , 6 2 ) + p|(oi, 62) = 4 cos 2 (7r/8) > 3. (2.14) 

The violation of these two equivalent inequalities shows that measurements on a maximally 
entangled state violate locality. 
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2.3.2 Non-locality without inequalities 

In the CHSH non-locality proof above we used either expectation values of quantum experi- 
ments, or equivalently the probability of a given outcome. To ascertain these quantities, many 
realizations of the experiment are necessary until we reach a statistical violation of the relevant 



inequality. In this section I review Hardy's simple proof of quantum non-locality [97, 98, 103], 
which makes do without statistical considerations. Instead, Hardy showed that we reach a logical 
contradiction if we assume: 

a- that quantum mechanics is correct; and 

b- that locality holds. 

This type of reasoning offers a different way of uncovering non-locality in quantum mea- 
surements, one which is perhaps more satisfying intellectually. This is because most people 
recognise a flagrant contradiction when they see one, but have more trouble doing the same 
when the contradiction only appears at a statistical level. Before Hardy, a similar inequality- 



free contradiction between quantum mechanics and locality had been found for four [92] and 



three particles |145| ] . Hardy simplified the reasoning and showed that a similar proof was also 
possible for tw o pa rticles only. Curiously, his proof does not use maximally entangled states, as 
references [92, 145] do. 



Before presenting the actual proof, I will dramatise the situation by discussing a hypothetical 
real-life scenario (a duel) which is paradoxical if we rule out non-locality. This is an extended, 
adapted version of a similar discussion by Hardy himself that appeared in |103| ]. When we 
come to the non-locality proof, we will see that its structure mirrors that of the duel, and that 
quantum non-locality offers a solution to the apparent paradox. 



2.3.2.1 A strange duel 

Suppose that we are running a business which provides duellists with rooms in which they can 
settle their scores. Our duelling stand consists of a very long room, at whose ends the two 
duellists will aim a pistol and shoot. After allowing them in, we close the soundproof room and 
let the duel take place. After each duel, Alice enters the room at end A and Bob at end B to 
attend to the duellists. 

Observation outcomes. After each duel, it is always the case that both gunmen are lying 
down, with their eyes closed. Since sometimes these same gunmen will be found to be alive, we 
assume they lie down even when not shot just to relax after the stressful event. The observers 
Alice and Bob can choose to perform one of two tests, but not both at the same time. These 
can be: find out if the man is alive (by listening to his heartbeat), or find out if his gun has fired 
(by measuring the temperature of the gun) . 

Alice and Bob attend to many duels, and by comparing notes they find out the following 
three puzzling facts: 

Observation 1- if both Alice and Bob checked their client's heartbeat, they discover that in 
all duels at least one gunman was killed; 

Observation 2- whenever Alice found A's gun unused and Bob checked B's heartbeat, he 
always found him alive, and vice-versa; and 

Observation 3- when both decided to examine the guns, sometimes it happened that neither 
gun was used. 

How can there be duels in which neither gun was used, but in which at least one gunman 
always dies? Perhaps one of the duellists killed using a concealed weapon, which could explain 
observations 1 and 3. This explanation, however, is inconsistent with observation 2. 

A bit of thought shows that there is no way to have duels consistent with observations 1,2 
and 3. One way to see this is to make a table of all possible combinations for experiment 
outcomes (there are sixteen of them). From these sixteen, there are only five which comply with 
observations 1 and 2, and none of these satisfy observation 3, i.e. involve two cold guns. 
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Hidden variables. But wait, there is perhaps one way to explain these strange duel out- 
comes. The normal duelling rules involve many assumptions: that they will try to shoot each 
other, the only weapons they have are the pistols, and so on. But perhaps the appearances here 
deceive us, and the duellists are actually maniacs who are just trying to puzzle Alice and Bob 
at any cost. If that is the case, then perhaps they have agreed on a sophisticated, morbid pact 
just to fool the two innocent observers into thinking that an apparently impossible task is being 
performed. 

One way of doing this would be for them both to lie down, while secretly glancing at the 
other end of the stand through half-closed eyes. When Alice and Bob come in, both gunmen 
have a chance to see what sort of equipment each is bringing into the stand. The equipment 
necessary for each measurement is different, being a stethoscope to check the heartbeat or a 
thermometer to find the gun's temperature. When both A and B realize what is going to be 
measured, they can set things up so that observations 1,2 and 3 are true, no matter what the 
cost. One simple, if contrived, way of doing this is for A and B never to shoot; and if both Alice 
and Bob come in with stethoscopes, then either A or B must quickly kill him/herself with a 
hidden cyanide tooth-cap. 

Non-locality. Note, however, that some communication was required between the two duel- 
lists in order to conform to the three observations above. In the case we have just presented, this 
communication was conveyed by visual means. As we have noted above, however, any successful 
cheating behaviour by the duellists must involve information about what is happening at the 
other end of the stand. But we can assume that the duelling stand is very long, and that Alice 
and Bob come in simultaneously to check on the duellists (i.e., the two ends of the stand have a 
space- like separation). In such conditions the two separated gunmen cannot communicate, and 
therefore there seems to be no way for them to conform to the three experimental observations 
of Alice and Bob. 

For space-like separated measurements, there is only one way for the gunmen to fool the 
observers. The gunmen must have access to some device which reacts instantaneously to what 
is being measured at the other end of the stand. Short of a faster-than-light signalling device, 
there is only one known way of doing this. This involves exploring quantum non-locality, as we 
will see in the next section. 

Suppose that, after many duels on a stand with space-like separated ends, Alice and Bob 
gather experimental data consistent with observations 1,2 and 3. Then, strange as it may 
seem, we will be forced to conclude that whatever the mechanism behind the strange gunmen 
behaviour, it must react instantaneously to distant events - it must be non-local. 

In the next section I will review Hardy's non-locality proof which uses non-local measure- 
ments mirroring exactly the duelling scenario I have just presented. We will see that a simple 
two-qubit entangled state can provide the non-locality necessary for space-like separated gunmen 
to successfully account for the three observations above. 

2.3.2.2 Hardy's quantum non-locality proof 

Instead of finding out information about two duelling gunmen, let us here consider measurements 
that Alice and Bob can make on a pair of entangled qubits. In the duelling scenario the observers 
have two mutually exclusive measurements options, one which distinguishes between alive and 
dead (men), and one between hot or cold (guns). In the quantum situation this corresponds to 
Alice and Bob each choosing to measure one out of two dichotomic observables. To keep the 
quantum discussion running parallel to the duelling paradox, let us denote these observables H 
(for health) and T (temperature). 

Observable H has two eigenvectors, corresponding to the two possible eigenvalues obtained 
in any measurement. Let us call these eigenvectors \a) and \d), for alive and dead, respectively. 
Similarly, observable T has the two eigenvectors \h) and |c) corresponding to hot and cold. The 
fact that Alice and Bob cannot measure these two simultaneously shows up in the quantum 
scenario from the non-commutativity of observables H and T at each location (note, however, 
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that [Ha,Tb] = [Ta,Hb] = because they refer to different particles). So we can express the 
\h) and |c) eigenstates as linear superpositions of \a) and \d): 



\h) l = a l \a) l + p i \d) l (2.15) 
c>< = Id),. (2.16) 



These two equations can be inverted: 



--a*\h) l + f3 l \c) l (2.17) 
ftWi-ailc)^ (2.18) 



where the subindex i indicates which particle we are referring to, either A or B. 

Using the two non-orthogonal states \h) and \a), we can define a particular entangled state 
of two qubits, which we will use to demonstrate non- locality: 

|V) = N (\h) A \h) B - a A a B \a) A \a) B ) , (2.19) 

where N(oia, «b, Pa, Pb) is a state-dependent normalisation constant. 

Observation outcomes. Now let us verify that the outcomes of observables H and T for 
state \tp) indeed conform to Alice and Bob's paradoxical-looking observations. Let us start by 
observation 1. 

The first observation stat es th at we never find both particles in the 'alive' state; this corre- 
sponds to saying that in eq. 2.19| the amplitude multiplying \a) A \a) B mus t be zero. In s pite o f 



the appearances this is actually true, as we can check by substituting eq. ( 2.15Q into eq. ( [2.1 9| ) 



|V) = N{(a A \a) A + Pa \d) A ) («b \o)b + Pb \d) B ) - a A a B \a) A \a) B } . (2.20) 

We see that the \a) A \a) B terms cancel out in the above expression, which means \tp) always 
satisfies the first observation. 

Let us now check whether a cold gun at A always means we find a live person at B (observation 
two). In our quantum scenario this translates as: is the amplit ude m ultiplying the t erm \ c) A \ d) B 
zero? The answer is yes, as we can check by substituting eq. ( [2.17 ) into eq. fl2.19|) for \a) A : 



|V) = N (\h) A \h) B - a A a B (a* A \h) A + Pa \c) a ) \a) B ) . (2.21) 

It is easy to perform a similar substitution and check that a cold gun at B also implies that A 
is alive. These two perfect correlations are what we need to satisfy observation 2 of Alice and 
Bob. 

Now all that is left to do is to check that observation 3 is true, and we sometimes find both 
guns cold. For that we need to have a non-zero amplitude for the \c) A \ c) B term. If we again 
substitute ( pT7| ) into ([TL9|) we find: 



|V) = N{\h) A \h) B - a A a B (a* A \h) A + Pa \c) a ) (a* B \h) B + p B \c) B )} . (2.22) 

Note that the amplitude multiplying \c) A \c) B is No.aoi B PaPb- This means that both guns 
will be found to be cold with probability | Noiaoi B PaPb\ ■ This probability will be non-zero 
whenever the two observables for each party fail to commute. If we denote the golden mean by 
t = g (l + \/5), the maximum value for this probability can be shown to be equal to 1/r 5 ~ 9% 
J98| , |97j, a value which is obtained for a less than maximally entangled state. 

Quantum non-locality. As we have noted in the previous section, both the duelling 
scenario and Hardy's non-locality proof contradict the locality assumption. In the duelling 
scenario, if the two observations are time-like separated, it was possible for the two gunmen to 
communicate and set things up in line with the three observations of Alice and Bob. In the 
quantum case this is also true, and these strange protocols go by the name of local hidden- 
variable theories. 

For space-like separated measurements, however, there can be no communication between 
the two space-time regions A and B. This restricts any local hidden- variable theory to define 
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the measurement outcomes without having access to information about the distant measuring 
apparatus. The only way of making sense of these quantum measurement results is to accept 
that quantum measurement outcomes in one place can be influenced instantaneously by distant 
experimental setups, i.e. quantum theory is non-local. 

Using quantum non-locality. Now maybe we should go back to the duelling scenario 
to convince those who are still sceptical about quantum non-locality. There, because we make 
reference to more day-to-day properties than projections on qubits, the apparent paradox seems 
to be more vivid and strange. A sceptical may be unwilling to accept that people and guns can 
exist in superposition states and behave non-locally in the same way as particles, and thus may 
think that the seemingly impossible observations of Alice and Bob cannot possibly be obtained 
for space-like separated measurements. 

It is important to stress, however, that the duellists can trick Alice and Bob by using quantum 
non-locality. For that, before each duel they need to share instructions on how to proceed (local 
hidden variables) plus two qubits in the entangled state maybe kept in futuristic matchbox- 
sized ion traps in their pockets. When the observer comes in, each gunman quickly presses a 
button, which implements the measurement of observable T or H, according to what equipment 
the observer is bringing in the stand. The ion-trap experiment outcome is dependent on what 
is being (was, will be) measured at the other end of the stand, and thus can be used by the 
gunmen to mimic the quantum mechanical correlations and trick the two observers. 

This last point illustrates a general idea that brings quantum non-locality closer to everyday 
life. The idea is to make use of non-locality to make some seemingly impossible communication 
tasks possible. This turns non-locality from a curious theoretical trait of quantum theory, into 
a useful resource that can be explored in quantum computing and communication. What is 
more, it becomes harder to doubt the existence of non-local phenomena once useful applications 
of non- locality can be successfully tested in the lab. In chapters and ||] I will explore some 
applications of quantum non-locality. 



2.4 Hardy's axioms for quantum theory 

One way to try to understand quantum mechanics is to find simple axioms from which one can 
derive the whole theory. An axiomatisation of a theory is a way to compress its information 
content to a small core of axioms, whose status and meaning can then hopefully be used to shed 
light on the essential characteristics, rather than the complex structure arising from them. 

Recently, Hardy proposed a set of five simple axioms from which he deduced the math- 
ematical structure of quantum theory for discrete systems flOC| , p.01[| . Since his axioms will 
prove important in our discussion of quantum communication in chapter 0, I am now going to 



reproduce them, adapting slightly from [101]. 



Before we proceed, let us recall what pure states are. We say a physical system is in a pure 
state when its measurement outcomes cannot be reproduced by any statistical mixture of other 
states. In this sense, pure states are extremal in the set of all allowed states. Operationally 
speaking, pure states embody the idea of states for which some property is known for certain, 
without statistical uncertainty. This is because for every pure state there exists an experiment 
which always results in a predictable outcome. 
Now we need a few definitions: 



Dimension : The dimension of a classical or quantum system (denoted by ./V below) is the 
maximum number of distinguishable states in which we can prepare the system. 

Degrees of freedom : The number of degrees of freedom of a system (denoted by K below) 
is the number of probability measurements necessary to completely describe a state. 

More generally, the number of degrees of freedom is the number of independent real parame- 
ters needed to completely characterise the state of a system. With these two simple definitions, 
we can state the axioms: 
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Axiom 1 Probabilities. Relative frequencies (measured by taking the proportion of times a 
particular outcome is observed) tend to the same value (which we call the probability) for 
any case where a given measurement is performed on a ensemble of n systems prepared 
by some given preparation in the limit n — > oo. 

Axiom 2 Subspaces. There exist systems for which N = 1, 2, • • • , and, furthermore, all systems 
of dimension N, or systems of higher dimension but where the state is constrained to an 
N dimensional subspace, have the same properties. 

Axiom 3 Composite systems. A composite system consisting of subsystems A and B satisfies 
N = N A N B and K = K A K B . 

Axiom 4 Continuity. There exists a continuous reversible transformation on a system between 
any two pure states of that system for systems of any dimension N. 

Axiom 5 Simplicity. For each given N, K takes the minimum value consistent with the other 
axioms. 



Rigorously speaking, the fifth axiom is not an axiom, as it makes reference to the other 
axioms. Nevertheless, the five propositions stated above are simple and can be motivated by 
our experience with classical probability and physical reality, with the exception of Axiom 4. In 
finite-dimensional classical systems, we can only jump discontinuously between one pure state 
and another, for example different digital values of a computer memory, or different sides of a 
die. However, if the word "continuous" is dropped from Axiom 4, it is possible to show (see 



100| , 101]) that the five axioms give rise to classical probability theory. 



By rejecting these discontinuous jumps between pure states of a finite-dimensional system, 
and re-introducing the word "continuous" in Axiom 4, Hardy showed that we can derive the 
mathematical structure of quantum theory instead. This includes the linearity of quantum 
mechanics, the trace formula for expected values of observables, and the general form of allowed 
super-operators. 

In this approach, the crucial difference between quantum and classical physics is the contin- 
uous nature of the set of quantum pure states, as opposed to the discrete nature of the set of 
classical pure states. As we will see in chapter H it is exactly this which allows for an unbounded 
separation in communication power between a qubit and any finite-dimensional classical system. 



2.5 The invariant information of Brukner and Zeilinger 

Another approach to the foundations of quantum mechanics was taken by Brukner and Zeilinger 



in a series of papers [£6], |218| , |2q , |28| , 27fl . It is based on taking as a foundational principle the 
following statement: 'an elementary system represents the truth value of one proposition'. This 
amounts to using what is basically Holevo's theorem as a foundational principle for quantum 
mechanics. 

Brukner and Zeilinger have been investigating how much of quantum theory can be derived, 
or better understood, using this simple principle and little else. As an example of their approach, 



let us have a look at how they analyse the simple case of two-qubit states [218, 29]. 

According to the principle, two qubits can encode a maximum of only two bits. However, 
there is a continuous set of possible encoding strategies. In one extreme, the two bits could be 
encoded only in the correlations of outcomes for Alice and Bob. One way to do this would be to 
prepare the two qubits in one of the four Bell states, which can be discriminated through local 
operations and classical communication between the two parties. The two bits of information can 
only be gathered when the parties communicate to compare their measurements, obtaining the 
tell-tale correlations. Because all the information is contained in the correlations, however, there 
must not be any way for them to gather more information through their local measurements. 
This would then be the reason why the local measurement outcomes are completely random 
(corresponding to a completely mixed reduced density matrix p = 1/2). 

To continue with this simple example of two qubits, we could instead prepare a two-qubit 
state so that the two encoded bits are available locally. This would entail preparing definite 



CHAPTER 2. ON THE FOUNDATIONS OF QUANTUM THEORY 



17 



states for each qubit separately. According to the principle, if the two bits are available locally, 
there must not be any further information to be gathered in the correlations. No entanglement 
is allowed, and we are dealing now with two qubits in a product state. 

Instead of dwelling further on the consequences of their principle, let us now turn to an es- 
sential ingredient of their approach, which is a mathematical expression quantifying the amount 
of information in a general state. This 'invariant information' Itotai W1 h be useful when we in- 
vestigate the so-called quantum random access codes in chapter Before defining the invariant 
information ItotaU let us review the Shannon and von Neumann entropies, and propose a quan- 
tum measurement scenario for which they are not the most adequate measure of information. 



2.5.1 Shannon and von Neumann entropies 

Let us start by considering a completely classical setting, before moving on to the quantum case. 
When we perform measurements on a classical system with d distinguishable states, we obtain 
probabilistic outcomes whose information content can be characterised by the following simple 
expression involving the Shannon entropy H(pi,p2, ...,Pd): 

d 

Is = log(d) - H( Pl ,p 2 , ...,p d ) = log(d) + Y,Pi lo s(Pi)- ( 2 - 23 ) 

i=l 

A completely predictable outcome corresponds to H = 0, and therefore to maximal information 
Is = log(d) bits. Intuitively, this is quantifying the number of bits of information that can 
be transmitted between two parties by preparing this system in one of its d distinguishable 
states. This would correspond to optimal use of the d distinguishable states for information 
transmission. 

There exist other less efficient ways of using the same classical system for information trans- 
mission, for example by preparing it in a statistical mixture of its distinguishable states. In 
this case H > and Is < log ((f); the less than optimal Is reflects the non-optimality of the 
preparation procedure. 

In classical physics all measurements commute and the information present in the state 
has a pre-defined, objective existence prior to and independently of the observation. Quantum 
systems are different in this respect. As we have seen in the previous sections, contextuality 
and non-locality in general prevent quantum systems from having experimental outcomes which 
are pre-defined independently on the choice of compatible observables being measured, even in 
space-like separated regions. This subtlety behind quantum measurements requires us to be 
careful when trying to quantify information encoded in quantum systems. 

When thinking about entropy or information for quantum systems, the first thing one could 
do would be to try to generalise the Shannon entropy to the quantum case. This is achieved 
by the von Neumann entropy S, which is simply the Shannon entropy of the eigenvalues of the 
d x d density matrix p. S(p) is given by 

S(p) = -Tr(p)log(p), (2.24) 

an expression which is invariant under change of basis. For an extensive discussion of the 
mathematical properties of S see the review article by Wehrl f206f. 



The von Neumann entropy S is central in many results in quantum information theory: 
channel capacities, entanglement dilution and concentration, teleportation, and so on. It appears 
in the simplest possible quantum information application: how to send information from Alice 
to Bob by using mixed states pi with respective probabilities pi. The von Neumann entropy of 
this 'alphabet' of states S(p = ^^PiPi) is an upper bound on the amount of i nform ation per 



message that Alice can send Bob. A tight bound was obtained by Holevo (see [106] and [ 151 , 
chapter 12]). 

Despite its importance and interest, we cannot expect the von Neumann or Shannon entropies 
to be the most adequate measures of information in all quantum information applications. Let us 
imagine a situation where a single quantum system can be measured with either of two projectors 
Pi or P2, which do not commute. We can think of preparing quantum states such that Pi's 
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measurement outcomes have maximum entropy, and P^'s minimum, or some balance of the two. 
In this context, the invariant information of Brukner and Zeilinger offers an interesting way to 
put bounds on the entropies of alternative, and mutually exclusive, measurements performed on 
a single system, as we will see next. 



2.5.2 Defining invariant quantum information 

Brukner and Zeilinger's approach to quantifying quantum information started from an analysis 
of the axioms used by Shannon for deriving what became known as Shannon entropy. They 
then proceeded to a criticism of the applicability of these axioms in the context of quantum 
measurements [p7tl , which motivated their proposed measure of quantum information. 

Here I shall motivate Brukner and Zeilinger's invariant information in a somewhat different, 
and simpler, way than their original proposal. Instead of considering quantum information in 
its generality and criticising the use of Shannon's entropy, I will concentrate on a simple quan- 
tum measurement scenario and look for an expression which properly describes the information 
gathered in it. 

Suppose that we are merely trying to quantify the amount of information that can be gathered 
in non-commuting measurements of a single quantum system. Because of the non-commutativity, 
we know that the measurement of a single system necessarily 'projects' it into one eigenvector 
of the first observable, ruining our possibility of obtaining further information through measure- 
ment of other observables. Nevertheless, there should be a balance between the information that 
can be gathered using alternative, exclusive choices of projective measurements. Our aim then 
is to describe quantitatively this balance between alternative measurement choices. 

To simplify things, let us keep to the most symmetric possible sets of observables, which are 
projections on mutually unbiased bases (MUB's). These are bases such that preparation of a 
quantum state along any basis vector of basis i yields equally probable outcomes for a projection 



on basis j ^ i. For dimensions d which are powers of primes it has been shown [114, |213| , 212 
that it is possible to define d + 1 MUB's. In the qubit case, the three MUB's can consist, for 
example, of the eigenvectors of projectors P x , P y and P z along the three Cartesian axes in the 
Bloch sphere. 

To quantify the amount of information, or the randomness of the resulting outcomes, one 
immediately thinks of using the Shannon entropy H of the measurement outcomes. For the 
example of one qubit, we may define the total entropy of these outcomes by the sum of the three 
Shannon entropy-based Is [eq. (|2.23| )1 associated with the outcomes of the three projectors: 

jtotal = 3 _ jj^pj _ R ^p y) _ H (p z y (2>25) 

For a qubit prepared in an eigenstate of P z , for example, p" 1 "- 1 = 1. We expect that a useful 
measure of information in this simple case should be invariant under unitaries. It is easy to 
see, however, that I t ° tal does not satisfy this simple criterion: it depends on the pure state 
being used, and it takes different values depending on the choices of MUB's. This suggests 
that defining information for such alternative settings by the sum of Shannon entropies of the 
outcomes of alternative measurements may not be a good idea after all. How can we understand 
this unsuitability of Shannon entropy in this case? 

The sum of Shannon entropies is adequate for describing the entropies of independent events, 
which exactly for being independent have additive entropies. The non-commutativity of the 
measurements we are considering imply they are not independent, and for this reason the sum 
of the Shannon entropies is not adequate. Using Shannon entropies in this case amounts to 
assuming that projections on the three MUB's have pre-determined, independent outcomes, as 
would happen with three hidden dice in three different boxes, only one of which can be opened. 

As we have seen earlier in this chapter, such previously existing values cannot even in prin- 
ciple be assigned to outcomes of non-commuting quantum measurements. Thus, in order to go 
ahead with the idea of quantifying the amount of information available through measurements 
on MUB's, we need to find a more suitable entropy function. 

Shannon's Y27=i Pi l°S(Pi) entropy formula has the virtue of being additive for independent 
systems. As argued above, this is not an appropriate assumption in the case of alternative 



CHAPTER 2. ON THE FOUNDATIONS OF QUANTUM THEORY 



19 



measurements on a single quantum system. Instead, we would like to have a sub-additive entropy, 
to reflect the fact that such projections over MUB's are not independent: a large information 
(low entropy) of outcomes of one projector necessarily implies a reduced information (larger 
entropy) associated with other non-commuting projectors. 

While the general functional form of the Shannon and von Neumann entropies is the only 
one that is additive, other entropic functions have been proposed without this property. These 
include 

a-l 

(2.26) 
(2.27) 
(2.28) 

which can be defined for a wide range of the parameter a. These were proposed respectively by 
Hardy, Littlewood, and Polya f9§; Tsallis p2| ; and Renyi p5fl 

The invariant information introduced 
additive entropy 




ay Brukner and Zeilinger in [BQ, uses the sub- 



n — 1 n ( 1\ 2 

U( Pl , P2 ,...,Pn) = - S(W--J > ( 2 - 29 ) 

n \ n J 

i=l ' 



which is quadratic in the probabilities and similar to the entropies (2.26)-( p.28| ) above. The 



factor 1/n is introduced as this is the probability associated with a uniformly random variable 
with n outcomes. Intuitively, this entropy function measures a sort of 'distance' from a proba- 
bility distribution to the maximally random one. I introduced the additive term here for 
convenience: this way the entropy is zero when one of the probabilities pi = 1, with all the other 
Pj's being equal to zero. 

We will see now that this non-additive entropy can be used to define an information measure 
with the desired property, i.e. invariance under unitaries. Let us define the information / of an 
n-outcome measurement as: 

I(pi,P2, ...,Pn) = - U(px,p2, ...,p n ) = E [Pi J • (2.30) 

i=i x ' 

As defined above, the information takes minimum value zero when all outcomes have equal 
probabilities p, = 1/n, and a maximum value of (n — l)/n when one of the probabilities is one, 
and all the others zero. 

We can use this definition of information to characterise the total information content of 
outcomes of projections on a set of d + 1 alternative MUB's that exist for a (i-dimensional 
system. For that, we just add the information obtained for d-outcome measurements on d + 1 
different MUB's. This constitutes the quantum invariant information Itotal of Brukner and 
Zeilinger: 

d+i d+i d , -.2 

hotai =£ j (pU.-.p£) = EE U -i) • ( 2 - 31 ) 

3=1 j=l i=l V aJ 

Using the properties of MUB's it is possible to evaluate eq. ( |2.3lD explicitly: 

hotai = Tr(p 2 ) - 1/d (2.32) 

(see pl| , chapter 2] for details). Now we see that, unlike the information measure using the 
Shannon entropy [eq. ( |2.25| )], Itotal is invariant under unitaries, depending only on the purity 



CHAPTER 2. ON THE FOUNDATIONS OF QUANTUM THEORY 



20 



of the system, as measured by Tr(p 2 ). Note that Itotai assumes a minimum value of zero for 
maximally mixed states p = and a maximal value of for a pure d-dimensional state. 

Itotai quantifies in a neat way the balance between information that can be gathered through 
different measurements that do not commute. It also provides an inequality-free version of 



Heisenberg's principle (see [75, |94j). More generally, there have been many uses of non-additive 
entropic functions, for example to analyse decoherence |202H and to quantify entanglement [183]. 

These results suggest that in at least some applications, it may be more convenient to consider 
non-additive forms of entropy for characterisation of quantum information. In chapter |7] we will 
see an example of this, where Itotai naturally provides us with means to quantify the advantage 
of using quantum systems instead of classical systems in a simple information transmission task. 



Chapter 3 

Quantifying entanglement 



When we use non-locality in a number of quantum information applications, it is natural to 
see it as a resource to be consumed and manipulated in different ways. This view led to many 
attempts to better understand and mathematically characterise entanglement in all its forms 
and types. 

Today quantum entanglement represents a wide research subject which has been flourishing 
vigorously in the last few years. In this chapter I will make no attempt to describe all the 
main results in the area. Instead, I will just give a bird's eye view of the subject of quantifying 
entanglement, mostly referring to work which will be useful later on. This will be particularly 
handy in chapter [|, where I describe some of my work on tripartite pure-state entanglement. 

We start in section |3.1| by distinguishing separable and entangled states, and reviewing some 
of the known results on separability criteria. In section 3.2 I introduce the two main approaches 
towards quantifying entanglement developed in the literature. These involve either discussing 
entanglement in single states, or considering the asymptotic limit of manipulations on large 
number of copies. Some of the main results of these two approaches are presented in sections 3.3 
and 3.4. Finally, in section 3.5 I introduce the quantum relative entropy, which can be used to 
formulate a measure of entanglement that will be useful when we discuss tripartite entanglement 
in chapter [|. 



3.1 Entangled versus separable states 

From a mathematical point of view, an iV-partite pure quantum state is defined as being sepa- 
rable when it can be written as a product of states of the individual parts: 

W)se P =\<t>)A\<f>) B ---\<f>) N , (3-1) 

where \<f))- represents a pure state belonging to party j. Similarly, a iV-partite mixed state p is 
separable when it can be represented as a convex combination of tensor products of (possibly 
mixed) states of its subsystems: 

Psep = (pA <S> Pb &> •• • <8> Pn) , = 1- (3.2) 

i i 

Note that any mixed separable state representable as in eq. fl3,2|) can also be put in the alter- 
native form 

Psep = ^2Pt\<Pi) (0iU l&) (<t>i\B-" l&) (&ljV> ^Pi = L ( 3 -3) 
i i 

Whenever the state is not separable, we say it is entangled. 

The definition above has a simple operational characterisation behind it: the subsystems 
of a separable state can be produced separately by the different parties, whereas this is not 
possible for entangled states. To produce the separable state ( |3.3D , each party j prepares state 
\4>i)j in a classically correlated fashion with the others. The states must be prepared according 
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to the probability distribution pi characterising the separable state p sep , which can be done if 
the parties previously share instructions on the appropriate sequence of states to prepare. This 
procedure is a perfectly valid way of creating the separable state p sep above. Entangled states, 
on the other hand, exhibit correlations among measurement outcomes which simply cannot be 
emulated by any such locally p repa red (i.e. separable) state. 

As we have seen in section |2.3| , measurements on entangled states can reveal non-locality. 
More precisely, for any bipartite pure state there exist measurements whose outcomes are incom- 
patible with any local hidden- variable theory (a result known as Gisin's theorem |$(], jS9[). In 
the case of multipartite states the situation is more complicated, but Popescu and Rohrlich have 
shown 159 ] that provided the parties can collaborate and use post-selection, then any multipar- 
tite pure entangled state violates some Bell- type inequality. The question of which ineq uality 
to use in order to reveal the non-locality of a given entangled state is hard; sec [220, 209, 55, |]] 
for some recent results. 

For mixed states, the relation between entanglement and non-locality is less well understood, 
and there are many open problems. In [207] Werner showed that there exist mixed states of 
two parties which are not separable, and yet admit a local hidden-variable model. If a state 
is distillable, then with some probability one can distill pure-state entanglement from it, and 
violate some Bell inequality by the results of |89|, |159|. However, as we will 



see soon, 



there exist bound entangled states which cannot be distilled by local operations and classical 
communication into standard pure entangled states. Some multipartite bound entangled states 
have been shown to violate Bell-type i nequ alities |73| , 59|, but it is not known whether this is 
the case for all bound-entangled stat es [180]. 

Unfortunately, the definition fl3.3j) of separable states gives no clue as to how to test a given 



density matrix for separability. A great deal of research has been done on the important problem 
of finding separability criteria. Let us briefly review one of the simplest yet strongest tests, the 
Peres-Horodecki criterion. 



3.1.1 The Peres-Horodecki separability criterion 

A remarkable early result was the Peres criterion, which is a powerful necessary criterion for 



bipartite separability [156]. Soon after its publicati on, the Horodecki family proved it was also 



sufficient for systems of dimension 2x2 and 2x3 [108], 

To motivate the criterion, let us consider the non-unitary operation of transposition. Despite 
being non-unitary, the transpose of a density matrix is also a density matrix. The transposition 
is an example of a mathematical operation that cannot be physically implemented, but which 
takes a density matrix a into another density matrix a T representing a physically possible state. 

Now let us consider what happens when we transpose the states corresponding only to 
subsystem A of a bipartite composite quantum system (tab- This partially transposed matrix p 
of a bipartite density matrix a is defined as 

Pm/i,nf — 0~n[i,rnvi (^•^) 

where the Latin indices representing subsystem A where transposed, while the Greek indices of 
subsystem B were not. In the case of separable states, this operation always results in a physical 
state. To see this, we can think of a simple way of preparing any separable state of two parties: 

Psep = ^Pi \4>i) {<t>i\A Ifa) (<folB>$JPi = L ( 3 - 5 ) 

i i 

One way to prepare this state is to issue instructions to Alice and Bob, so that they release the 
classically correlated states \4>i) a an d \4>i) b w hh the right probability distribution p^. This can 
always be done for separable states, because they are exactly those states admitting a description 
in terms of definite subsystem states. 

The partial transpose of separable state ( |3.5| ) is another separable state, in which each of 
the states describing Alice's subsystems are transposed. We thus see that partial transposition 
takes separable states into separable states, even though again it is not a physical operation that 
can be performed on Alice's system. Rather, it is a mathematical operation resulting in another 
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separable density matrix which can be physically prepared using classical correlations only, as 
described above. 

Peres noted, however, that when applied to entangled states, the partial transposition need 
not result in a physical state. This is related to the fact that entangled states do not admit a 
description in which each subsystem's state is specified without reference to other subsystems, 
as is the case with separable states. The partial transposition of entangled states can result 
in negative operators, i.e. operators with negative eigenvalues. This leads to Peres' necessary 
criterion for separability: 



Peres' separability criterion [156]: Positivity under partial transposition (PPT for short) 



is a necessary condition for separability. 

This separability test has the great advantage that it is computationally simple to perform. All 
that is needed is to sw ap s ome elements of the density matrix [corresponding to the partial 
transposition operation fl3,4|) [ , followed by a simple diagonalisation to find whether the resulting 
matrix has negative eigenvalues. If it does, the state is entangled. 

This raises the question of whether the condition is als o su fficient to guarantee separability. 
The Horodeckis answered this question in the negative [|108| ] by proving that it is sufficient 
only for systems of dimensionality 2x2 and 2x3. Interestingly, they showed that there exist 
bound entangled states, which despite being PPT (positive under partial transposition), are not 



separable. In [11C] it was shown that the entanglement of these states cannot be distilled into 
a useful form for protocols such as teleportation, and it was this property which motivated the 
term 'bound entanglement'. 

Since then, many other results on separability were obtained; for a review see Lewenstein et 
aVs article [135|. Some important criteria have been derived recently, and were not reviewed in 



|135||. These include the results reported in [167, 168] and ||68|, for example. 



3.2 Different approaches to quantifying entanglement 

When trying to quantify entanglement, it is important to specify which type of system and 
approach we are taking. Some useful distinctions among quantum systems are: bipartite versus 
multipartite states; general mixed states or pure states only; and systems described either by 
discrete or continuous variables, examples of which are a spin-1/2 particle and the electric field, 
respectively. In this chapter I will mention some results on all these areas, with the exception 
of continuous variables. 

An important question regards how to manipulate entanglement and convert it from one 
form to another, using only local operations and classical communication (LOCC). If we allow 
operations more general than these (for example by allowing quantum communication), it is 
obvious that we can create any amount of entanglement. Since it is exactly the non-locality 
present in entangled states that we want to quantify, we need to restrict ourselves to finding 
out what transformations are possible under LOCC only. Unfortunately, LOCC are a set of 
operations which is very hard to characterise mathematically, unlike simple unitaries for exam- 
ple. Nevertheless, many important results have been obtained, and they pertain to two main 
approaches which we need to specify. 

The first approach consists of investigating how to interconvert the entanglement present 
in single copies of entangled states. The second approach is inspired in classical information 
theory, and asks what is possible to do when the parties have access to an ensemble of states, 
i.e. an arbitrarily large number ./V of identical copies. Let us now briefly review some important 
results obtained in recent years along these two lines of research. 



3.3 Single-copy approach 

This approach investigates which transformations are possible with LOCC on single states. If 
necessary and sufficient conditions for this type of manipulation on mixed states can be found, 
this would be directly relevant to experiments involving small number of qubits. On the other 
hand, there are clues that the overall picture of entanglement simplifies only in the asymptotic 
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limit of large number of parties, much in the same way as it happens in the asymptotic limit 
of classical information theory. This is suggested by some results I will describe in this and the 
next section. We will see that, in the single copy approach, the number of entanglement classes 
increases rapidly with the number of parties. Moreover, in the bipartite pure-state case a single 
number can be shown to encapsulate the entanglement properties of any given state, but only 
in the asymptotic limit of many copies. 

A key result in the single-copy regime is Nielsen's theorem [p.50[ , which establishes a simple 
necessary and sufficient condition for interconvertibility of pure bipartite states under LOCC. In 
order to state it, first we need to introduce the concept of majorization. To do that, let us use 
the notation to indicate a re-ordered vector v, in such a way that the largest element comes 
first, the next-largest second, and so on. Then we can say that x = (x±,X2, ■ ■ ■ , Xd) is majorized 

by y = (yi,V2, ■ ■ • , Vd) (written x -< y) if Y^=i x j ^ Ej=i Vj for fc = 1, 2, . . . , d, with equality 
when k = d. 

Now, let us define \ p to be the vector of eigenvalues of density matrix p. We are now in 
position to state Nielsen's theorem: 

Theorem (Nielsen Jl50| ) A bipartite pure state \tp) can be tranformed by LOCC to state \cj>) 
if and only if Aw, < 

This theorem basically closes the interconvertibility problem for exact transformations of bipar- 
tite pure states. Soon after it was published, Hardy found an alternative geometrical proof of 
the theorem, and obtained the optimal strategy for distilling maximally entangled states from 
any bipartite pure state |99|. Jonathan and Plenio derived the same result independently in 



|116| , 117] using other methods, and also worked out other applications of Nielsen's theorem. 

The theorem above only deals with exact transformations. Where an exact transformation 
is not possible, we can always try to perform an optimal, approximate transformation instead. 
This will be necessary in most cases, as interconvertibility is not guaranteed in general. For 
example, \(p) cannot be converted into l^) whenever the latter has a larger Schmidt number 
than the former. 

It is thus important to address the question of approximate transformations. More precisely, 
starting with state we may want to find, the closest approximation \(f>approx 

) of target state 



that can be reached. In [201] Vidal et al. obtained this optimal approximate conversion strategy 
for pure states, when we quantify the distance between |0) and \4> a pprox) by their mutual fidelity. 

When we leave the pure states and consider mixed states, or even pure states of more 
parties, the problem of interconvertibility becomes harder. Even for mixed states of two parties, 
no necessary and sufficient criterion was found to date. Some partial results are known: for 
example, Vidal introduced entanglement monotones, which are functions that cannot decrease 



under LOCC, for pure or mixed states of two or more parties [196]. These monotones can be 



used to formulate necessary criteria for a LOCC transformation to be possible. 

3.3.1 Multipartite entanglement 

An interesting result c once rns the number of different types of pure multi-party entanglement. 



It has been shown in 197 , 16] that two pure multipartite states and |0) can be obtained 



deterministically from each other by LOCC if and only if they are related by local unitaries. 



Since even in the bipartite case this is not usually the case [197, 136, f5J, if we define 'different 
types of entanglement' by mutual deterministic interconvertibility we will be stuck with disjoint 
families of different types of entanglement parametrised by a continuous parameter. 

This motivated Diir, Vidal and Cirac to relax somewhat this interconvertibility criterion, 
grouping entangled states in the same class if they can be interconverted by LOCC with any 
non-zero probability, instead of deterministically. This makes sense, as states in the same class 
can be used for the same applications, albeit with different probabilities of success. 
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Using their newly defined classifying criterion for multipartite entanglement, Diir, Vidal 
and Cirac were able to show that there are two genuinely different kinds of tripartite entangle- 
ment for three-qubit states [74]. The generic family consists of states which can be converted 
stochastically into GHZ states 



|GHZ) = — (|000) + |111)). 

Remarkably, there is another family of states with tripartite entanglement which cannot be 
converted into GHZ states, and therefore belong to a different class. A representative of this 
class is the so-called W stateQ 

|W) = -^ (|001) + |010) + |100)). (3.6) 

This state has an interesting entanglement structure which we will explore further in chapter [|. 

Following this approach, in [195] the nine different classes for a four-qubit system were worked 
out, the generic class being again the GHZ state of four qubits. More generally, in [74] it was 
shown that the number of different entanglement classes increases rapidly with the number of 
parties. 



3.4 Asymptotic limit approach 

A second approach to quantification of entanglement takes its inspiration from classical infor- 
mation theory, and considers the transformations under LOCC on an arbitrarily large number 
n of identical copies of the same state. It turns out that we can use a single number to quantify 
the amount of entanglement of any pure bipartite state in this asymptotic limit. 

In order to discuss this problem and introduce multipartite entanglement, first we need to re- 
view concepts such as asymptotic equivalence and Minimal Reversible Entanglement Generating 
Sets (MREGS). 



3.4.1 Asymptotic equivalence 

Let us now present the idea of asymptotic reducibility between two multipartite pure states, 
discussed by Bennett et al. in |fTq|. Let us denote by F the fidelity between two states \ip) and 



(3.7) 

Also, let us use C to denote any locally implementable super-operator, i.e. any manipulation of 
the quantum state which can be performed by using LOCC only. Then state \tp) is said to be 
asymptotically reducible to state |</>) if and only if 

V5>0,e>0 3n,n',£ (3.8) 
\{n/n')-l\<5, F(£(V 0n '),0 0n ) > 1-e, (3.9) 

In words: \if)) is said to be asymptotically reducible to state [</>) if there exists a LOCC operation 
that takes n' copies of into an arbitrarily good approximation of n copies of |</>), for large 
enough n' ,n. 

With the concept of asymptotic reducibility, we can define asymptotic equivalence. States 
\ip) and \4>) are asymptotically equivalent if and only if \tp) is asymptotically reducible to \<p), 
and vice-versa. 

To illustrate the usefulness of these concepts, let us re-state the results of fll| using them. 



In p5 |, Bennett et al. showed that all bipartite pure states are asymptotically equivalent to a 



pair of qubits in a Bell state, for example 



|^-) = -^(|0 a 0b>-|1a1b». (3+0) 



1 Named after Wolfgang Diir, one of the authors of |74|. 
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Moreover, it was shown that a simple function quantifies this reducibility relation. To present 
this result, let us first note that asymptotically reversible transformations can have non- integer 
yields. More precisely, we say \il))® x is asymptotically equivalent to \cf))® v (with x, y real numbers) 
if and only if 



V5>0,e>0 3n,n',£ (3.11) 
\{n/n')-x/y\<6, F(C(^ n '),^ n )>l-e. (3.12) 

Now we can consider asymptotic transformations of copies of a less-than-maximally entangled 
state | rj) and maximally entangled states |VO : 

\ V )®* L( S C \rf y ■ (3.13) 

Bennett et al. showed that this transformation is possible in the asymptotic limit, with the 
asymptotic yield x/y approaching S(pa) = S(ps), the von Neumann entropy of the reduced 
density matrices of state \rj). 

Transformations from EPR pairs to less entangled states are referred to as entanglement 
dilution, and the reverse is called entanglement concentration. As we discussed above, the same 
problem can be discussed in the finite-copy approach, but in that case the transformations are 
in general irreversible. 

Based on this result, we can say that in the asymptotic limit there is only one type of bipartite 
pure-state entanglement, and this is conveniently represented by an EPR pair of qubits. This 
entanglement can be manipulated reversibly in the asymptotic limit, which is a very nice result 
and gives support to the idea that entanglement is a resource to be quantified and manipulated 
for different quantum information processing protocols. 



3.4.1.1 Mixed state entanglement 

When we consider entanglement manipulations with mixed states, the situation becomes in- 
triguing even for bipartite states. In the single-copy case, the Horodeckis showed [ |110| , |109| ] 
that there existed entangled states from which no entanglement could be distilled. In bipartite 
systems, these peculiar states exist only in dimensions 3x3 and higher. They were named 
bound entangled states, because the entanglement in them seems not to be in a 'free form', as 



it cannot be distilled into a form useful for a task such as teleportation. Surprisingly, in [147] 
a quantum information task was shown to have a better performance when using a tripartite 
bound-entangled state, than would be possible with classically correlated states. The properties 
and possible uses of bound entangled states are yet not clea r; a good reference discussing this 
problem and separability is the review by Lewenstein et al. [|l35f |. 

Because they are not separable, some entanglement needs to be invested to create bound 
entangled states, even though none can be distilled in the single-copy regime. Although it 
seemed unlikely, the possibility remained that in the asymptotic limit the required entanglement 
to create these states would tend to zero, and the distillation/concentration processes would 
recover the reversibility of the pure-state case. It was only through the recent work of Vidal 
and Cirac |198| , [199(1 that the question was definitely settled in the asymptotic regime. They 
showed that also in the asymptotic regime a non-vanishing amount of entanglement is required 
to create some entangled states, while no entanglement can be distilled from them. 



3.4.2 Minimal Reversible Entanglement Generating Sets (MREGS) 

In reference ju]] the concept of Minimal Reversible Entanglement Generating Sets (MREGS for 
short) was introduced as a natural way of tackling the pure-state entanglement problem in a 
multipartite setting. This concept sets a framework which I use in chapter [5] to explore tripartite 
pure-state entanglement in the asymptotic limit. 
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Definition: A Reversible Entanglement Generating Set (REGS) for iV-partite pure states is a 
set G with j pure iV-partite states G = {\4>i) , \ 4>2) , ■ ■ ■ , \ <Pj)} having the property that any 
iV-partite pure entangled state is asymptotically equivalent to tenso r prod ucts of states in 
G (the definition of asymptotic equivalence can be found in section |3,4.1 ). 



Definition: A Minimal Reversible Entanglement Generating Set (MREGS) is a REGS with 
smallest possible cardinality. 



In section 3.4.1 above we have reviewed the results of [15], which can be conveniently sum 



marised using the concept of an MREGS for bipartite states. Their fundamental result was 
to prove that any single bipartite pure entangled state is an MREGS for two-party entangle- 
ment. The most nat ural s uch MREGS for bipartite pure states consists of a single EPR pair, 
for example in state ( |3.10| ). 

The idea of an MREGS is not useful only for bipartite states; it provides a solid starting point 
for addressing important questions regarding multipartite entanglement. Among them, we can 
ask: is there a finite MREGS for multipartite entanglement manipulations? Which states must 
be present in such an MREGS? In chapter [5] I describe my work on this problem for tripartite 
states. 

A useful measure of entanglement in a bipartite or multipartite setting is the relative entropy 
of entanglement, which we will use in chapter || In the next section we motivate its definition 
and review some of its properties. 



3.5 Quantum relative entropy 

In this section we will briefly review some properties of a very useful measure of entanglement 



called relative entropy of entanglement [193, 192]. It can be applied either in the finite-copy case, 



or in the asymptotic case. What is more interesting, however, is that it is based on a well-founded 
idea, that of quantifying distinguishability between entangled and separable states. Before we 
discuss the quantum version, let us see how relative entropy arises in classical information theory 
as a measure of distinguishability of probability distributions. 

For an accessible introduction to relative entropy in classical information theory see the 
book by Cover and Thomas |56| . As for quantum relative entropy, its many uses in quantum 
information theory have been reviewed recently by Vedral in [|190|[. 



3.5.1 Classical relative entropy 

In many situations we need to compare two probability distributions pi and and quantify 
how different they are. The relative entropy D(p\\q) was first defined by Kullback and Leibler 



[124], and quantifies this 'distance' between two probability distributions: 



with the convention that Olog - = and plog ^ = oo. 

D(p\\q) has many properties: it is always non-negative; it is zero only when p = q; and it is 
not symmetric. This last point may be surprising, as we have mentioned it can be interpreted as 
a 'distance', and a properly defined mathematical distance must have the symmetry property. 
In order to perceive intuitively the necessity for this asymmetry, it is enough to consider the 
problem of distinguishing between tosses of a fair coin and those of a coin so loaded it always 
comes out Tails. If we do not know which coin we have, the assumption that the coin is loaded 
can be dismissed completely after just a few tosses, whereas the assumption that it is fair (when 
in fact it is not) always has a finite probability 2~ N of being upheld after N tosses. This simple 
example shows that D(p\\q) must be asymmetric, as it depends on which probability distribution 
we expect (q), and which is actually obtained (p). 

The reason why the relative entropy D(p\\q) is unique is because it quantifies how hard it is 
to distinguish two probability distributions, in a very operational sense. It follows from Sanov's 
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theorem [169] that the probability of mistaking events happening according to probabilities p for 
events that would arise from probability distribution q decreases exponentially with the number 
of trials N, times the relative entropy D(p\\q): 

^(taking p for q) = 2~ ND ^\ (3.15) 

which is valid for large N. In our simple example we toss a loaded coin {p\ = 0,p2 = 1) and 
ask the question: is it possibly a fair one (gi = q% = 1/2)? After N tosses we can be sure it is 
loaded with probability roughly 1 — ^(taking p for q) = 1 — 2~ ND ^ q ^ = 1 — 2~ N . The relative 
entropy D(p\\q) = 1 appears in the exponent of this small depart from certainty, to quantify the 
probability of error in distinguishing p and q. 

If we still keep the coin example, let us see what happens when we reverse the situation, 
tossing a fair coin and asking whether it may be a loaded coin always yielding Tails. In this 
case, a single Heads result gives us complete certainty that the coin is not loaded, and thus the 



exponential approach to certainty (given by eq. 3.15) is not valid anymore. This is reflected by 



the fact that D{q\\p) = oo, as can be easily checked. 
3.5.2 Relative entropy of entanglement 

The relative entropy has a straightforward quantum generalisation. We define the von Neumann 
relative entropy between two density matrices a and p by: 

D(a\\p) = Tra(ioga- logp). (3.16) 

Hiai and Petz |105j ] have shown that D(a\ \p) has the same statistical interpretation as its classical 
analogue, indicating how hard it is to distinguish measurement outcomes on state a from our 
expectations of outcomes from state p. 

More recently, Vedral and Plenio 11931, fL92|] have suggested the use of the quantum relative 



entropy to quantify entanglement. They defined the relative entropy of entanglement E(cr) to 
be: 

E(a)=mmD(a\\p), (3.17) 

where S is the set of separable states. The motivation for this definition is, as argued above, 
its interpretation as a measure of distinguishability between quantum states. By taking the 
minimum over the set of separable states, we are finding the separable state p^ n whose mea- 
surement statistics most closely resemble that of an entangled state a, and quantifying the 
distinguishability between a and this closest separable state p^f n - 

The relative entropy of entanglement was also proved to satisfy four basic requirements for 
entanglement measures: it is zero for separable states; it is invariant under local unitaries; it 
cannot be increased by local operations and classical communication; and it reduces to the von 



Neumann entropy S(a) for pure states a |192|, 19C]. 



One property which may be desirable from measures of entanglement is additivity. In order 
to define additivity of the relative entropy of entanglement E, we need to discuss how it can be 
applied to iV copies of a given state, or even in the asymptotic limit N —* oo. To discuss this 
limit, we can define a regularised relative entropy of entanglement E re9 by 

E re9 (a AB ) = lim n _ 00 (l/n) min D (a%\\p AB ) . (3.18) 

PAB&S 

If E reg (p) = E(p) we say E is asymptotically additive for state p; if E{p\®p2) = E(p\)-\-E(p2) 
V/9i,P2 then we would say that E is a fully additive measure. 

Until very recently the relative entropy of entanglement E was conjectured to be fully addi- 
tive, as numerical investigations had failed to identify sub-additivity. However, Vollbrecht and 
Werner found a counter-example to the full additivity conjecture |203| ] for two subsystems of 
dimension d > 3 each. This was a surprising result, bearing on the work I report in chapter [|. 



Chapter 4 

Quantum information applications 



In the previous chapters we have reviewed some general properties of quantum systems, such as 
contextuality and non-locality. We have also sketched some different approaches to the problem 
of entanglement characterisation. In this chapter we will review three quantum information ap- 
plications which draw on these quantum characteristics for their better-than-classical efficiency 
A great number of quantum information applications have been developed in the last few 
years. I will make no attempt to review the very many ideas about how to use quantum phenom- 
ena for information processing. Instead, I will concentrate on describing the main features of 



just three applications: quantum communication complexity protocols in section 4.1; quantum 
random access codes in section [4.2|; and quantum cloning in section |~~ 



In later chapters I will investigate many characteristics of these three applications, with an 
emphasis on finding out why they are better than their classical counterparts, and what they can 
tell us about quantum theory. I will also discuss the feasibility of experimentally implementing 
some instances of the first two applications. 

Let us start by discussing communication complexity problems, which illustrate in a clear 
way the communication requirements for a type of distributed computation. 



4.1 Communication complexity 

The notion of communication complexity was introduced by Yao |216j ], who investigated the 
following problem involving two separated parties (Alice and Bob). Alice receives a n-bit string 
x and Bob another n-bit string y. They are then allowed to perform local computations and 
to exchange some communication, in such a way that one of them (say Bob) can announce 
the value of a function f(x,y). The resource we will try to minimise here is the amount of 
communication between them, hence the name 'communication complexity'. Note that here we 
are not concerned about the number of computational steps, or the size of the computer memory 
used. Communication complexity tries to quantify the amount of communication required for 
such distributed computations. 

Of course they can always succeed by having Alice send her whole n-bit string to Bob, who 
then computes the function, but the idea here is to find clever ways of calculating / with less 
than n bits of communication. 

This abstract problem is relevant in many contexts: in Very Large Scale Integrated (VLSI) 
circuit design, for example, one wants to minimise energy use by decreasing the amount of 
electric signals required between the different components during a distributed computation. 
The problem is also relevant for the study of data structures, and in the optimisation of computer 



networks. For a survey of the field, see the book by Kushilevitz and Nisan [126]. 

This simple two-party scenario can be generalised in many different ways. We can think 
of more parties involved in the distributed computation; consider deterministic or probabilistic 
protocols; or allow for quantum phenomena to be used. This last possibility generalises com- 
munication complexity to the quantum realm, where we can use quantum communication or 
entanglement. 
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4.1.1 Quantum communication complexity 

Quantum communication can be more efficient than classical communication for some tasks. 
Dense coding [^], for example, is a process in which one previously shared pair of maximally 
entangled states allows one party to communicate two bits to a second one by sending only a 
single qubit. This 'two bits per qubit' figure is, however, misleading, as an entang led qubit had 
to be shared between the parties beforehand. More generally, Holevo has shown |106| 1 that the 
mutual information between two unentangled parties cannot increase by more than one bit per 
qubit of quantum communication between them. This is still true when the two parties can 



exchange qubits both ways |149| , p3|. 



So even if we allow for previously shared entanglement, quantum communication cannot be 
more than twice more efficient than classical communication at sending information between two 
parties. It may come as a surprise, then, that for distributed computations quantum communi- 
cation and entanglement can result in even exponential gains over optimal classical protocols. 
Quantum communication complexity tries to quantify the communication reduction possible by 
using quantum effects during a distributed computation. 

At least three quantum generalisations of communication complexity have been proposed; for 
a survey see Brassard's article The first one is the qubit- communication model introduced 
by Yao [|217[ and Kremer [123 1 . In this model the communication is performed by exchanging 



qubits between the parties, instead of classical bits. In chapters [7| and |8| I will analyse some simple 
communication complexity tasks which can be better performed with quantum communication 
than with classical communication. In chapter |8| we will see that the quantum advantage can 
be so pronounced that there exist feasible schemes to experimentally demonstrate it even with 
very low quantum detection efficiencies. 

In the second model the communication is still performed with classical bits, but the parties 
are allowed to manipulate an unlimited supply of quantum entangled states as part of their 
protocols. This is the entanglement-based model int r oduc ed by Cleve and Buhrman ]5^1 , and 
further investigated by various authors |3^, |164[ , |102| . In chapter |7| and || I will analyse 
some simple entanglement-based quantum protocols, and show how they relate to quantum 
non-locality and contextuality tests. We will also see that such protocols naturally give us 
some bounds on the detection efficiencies and background rates necessary for experimentally 
establishing these quantum characteristics. 

The third model involves access to previously shared entanglement in addition to qubit 
communication, and is the least explored of the three quantum models. 

Quantum communication complexity protocols feature prominently in this thesis, appearing 
in chapters [?] and |8[ They also served as an inspiration for the work I report in chapter ||. 



4.2 Quantum random access codes (QRAC's) 

There are many ways in which quantum communication can be superior to classical communi- 
cation. Imagine a situation in which Alice encodes m classical bits into n bits {m > n), which 
she sends to Bob, who will need to know that value of a single bit (out of the m possible ones) 
with a probability of at least p. We may represent such an encoding/decoding scheme by the 

notation: m n. 

Prior to sending the n-bit message, however, Alice does not know which of the m bits Bob 
will need to read out. To maximise the least probability of success p, Alice and Bob need to 
agree on the use of a particular, efficient m — > n encoding. 

We can consider the quantum generalisation of this situation, in which Alice can send Bob 
n qubits of communication, instead of n bits. The idea behind these so-called quantum random 
access codes is very old by quantum information standards; it a lready appeared in a paper 
written circa 1970 and published in 1983 by Stephen Wiesner |21C|1 . 

These codes were re-discovered in [H, where the explicit comparison with classical codes 
was made. One way to think about the possibility of a quantum advantage is to think of the 
(generally) non-commuting measurements we can use in the decoding process. Because of the 
non-commutativity, each particular measurement disturbs the system, destroying some of the 
information that could have been revealed by another choice of measurement. Thus, we see that 
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the constraints on what can be achieved by a single measurement on a quantum system of n 
qubits are less severe than in the classical case, possibly allowing for a higher quantum encoding 
efficiency. 

In chapter [7| we will investigate the role played by quantum contextuality and non-locality in 
these quantum random access codes. We will also see how the invariant information of Brukner 
and Zeilinger helps us to bounds the performance of more general QRAC's. 

4.2.1 A simple 2 -> 1 QRAC 

Let us illustrate the concept with the simplest such scheme, which encodes Alice's two-bit string 
bob\ into a single qubit's states (a 2— >1 quantum random access code). Before we discuss the 

quantum code, let us find out what the optimal probability of success is for classical 2 — ► 1 
codes. 

4.2.1.1 Optimal classical protocol 

Alice and Bob need to decide on a protocol defining which bit-valued message is to be sent by 
Alice, for each of the four possible values of her two-bit string x. Each of these 2 4 = 16 different 
deterministic protocols proti has a probability of success p c (proti) which can be evaluated in a 
straightforward way. The optimal deterministic classical protocols can then be shown to have a 
probability of success p c = 3/4. 

Now we need to address the possibility of probabilistic protocols. Any probabilistic protocol 
can be represented as a convex combination of the 16 deterministic protocols: J2]=i(liP r °ti)' 
with weights qi satisfying Yll=i 1i = 1- The corresponding probability of success p c for any such 
probabilistic protocol will be given by the weighted sum of the probabilities of success of the 
individual deterministic protocols: p c = Yll=i [liPc(P r °U)]- This makes it clear that the optimal 
probabilistic protocols can at best be as efficient as the optimal deterministic protocol, which 
we have already shown can have a probability of success of at most p c = 3/4. 

This is a much simpler proof than the one in the original paper by Ambainis et al. ||. Note 
that there, the definition of the probability of success p is slightly different: they define p as 
the least probability of success, whereas I define p as the average probability of success for a 
uniformly random distribution of Alice's bit-strings. 



4.2.1.2 Quantum protocol 



l^o) = \^/2,n/4)) , 

\<Poi) = |^(tt/2, 7tt/4)> , 

\<ho) = ^(vr/2,3vr/4)), 

\<j>u) = |^(tt/2, 5tt/4)> . 



(4.1) 
(4.2) 
(4.3) 
(4.4) 



Ambainis et al. also showed that there is a quantum 2 1 encoding with p = cos 2 (7r/8) ~ 
0.85. It works as follows: depending on her two-bit string &o^ij Alice prepares one of the four 
states \(f>b bi)- These states are chosen to be on the equator of the Bloch sphere, separated by 
equal angles of it/ 2 radians (see figure |4~l| ) . 

Using the usual Bloch sphere parametrisation 

|V(0, <t>)) = cos(|) |0> + exp(^) sin(|) |1) , 

we can represent the encoding states depicted in figure |4.1| as: 



Bob's decoding measurements will depend on which bit he wants to obtain. To decode bo, 
he projects the qubit along the x-axis, and to read &i he projects it along the y-axis. The then 
associates a result '0' if the measurement outcome was along the positive direction of the axis 
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Figure 4.1: Qubit states used in the 2 — » 1 QRAC. Alice prepares the qubit in one of the four 
states pictured above, which lie on the equator of the Bloch sphere. Bob's decoding procedure 
will consist of projecting either on the x-axis or the y-axis, depending on which of Alice's bits 
he wants to read. 



measured, and '1' otherwise. It is easy to check that the probability of successfully retrieving 
the correct bit value is the same in all cases: 

p q = cos 2 (vr/8) ~ 0.85, 

which is higher than the optimal probability of success p c = 0.75 of the classical random access 
code using a bit of communication. 

In chapter |7] I will analyse this simple quantum random access code and show that its higher- 
than-classical efficiency arises from quantum contextuality an d no n-locality. Using Brukner and 
Zeilinger's invariant information (see section ^^), in section \[.5\ I will derive some bounds on 
the probability of success of more general codes. 

We will also see that the simplest quantum random access codes involve the same quantum 
operations as other quantum information applications; this does not seem to have been noticed 
before, and helps in understanding the fundamental basis for the quantum advantage in a number 
of similar applications. 



4.3 Quantum cloning 

One of the first results stressing the peculiarities o f qu antum information was the no-cloning 



theorem obtained in 1982 by Wootters and Zurek [214] and independently by Dieks [p7|| . In 



simple terms, they showed that unlike classical information, an arbitrary quantum state cannot 



in general be perfectly copied. In section [4.3.1| we will review a simple version of this important 
theorem, and briefly discuss its relevance. 

From a computational point of view, the no-cloning theorem exemplifies one of the funda- 
mental differences between quantum and classical information processing. A classical memory 
register can be read out and operated upon at any point during a computation; in particular, 
it is always possible, and indeed necessary in many cases, to perform the COPY operation on 
a subset of the registers. A quantum register, however, can encode a large amount of classical 
information (see chapter which cannot in general be read out and copied during the unitary 
evolution that characterises quantum algorithms. 

It may be impossible to make perfect copies of quantum states, but approximate copies are 
not ruled out by the no-cloning theorem. This idea led Buzek and Hillery to develop the so- 
called universal quantum cloning machines (UQCM's for short) |^(|, which are unitary quantum 
operations resulting in good-quality (but not perfect) copies of a given quantum state. Since 
then many authors have further investigated the limitations that quantum mechanics imposes 
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on un itary distribution of quantum information. I will describe their main findings in section 
4.3.3 . 

Another strategy for quantum cloning is to use a procedure combining unitary evolution and 
measurement to obtain perfect clones with some probability p < 1. This approach was developed 



by Duan and Guo [71, ^] in 1998. In section 4.3.5 I briefly describe these probabilistic cloning 
machines. 

These two cloning approaches have something in common: they are necessarily imperfect. 
The limitations of UQCM's show up in the noise present in the imperfect clones created. In the 
case of probabilistic cloning, the problem is that the process only works with probability p < 1. 

In chapter |6| I will present my work on the use of cloning for quantum computation. We 
will see that despite their imperfections, during quantum computations cloning may offer an 
advantage over approaches that do not resort to quantum information distribution. For example, 
we will see that in some cases the noise introduced by UQCM's represents a lesser problem than 
the irreversible loss of quantum information represented by alternatives involving extraction of 
classical information through measurement. 

Despite its importance from a theoretical point of view, until recently very few, if any, 
applications of cloning procedures had been found. Distributing information during quantum 
computations is an application that stresses the fundamental differences between quantum and 
classical information. 



4.3.1 The no-cloning theorem 

To get the basic idea behind the no-cloning theorem [ |214| , |67| ], suppose we are given a qubit in 
an unknown pure quantum state \tp) and we are asked to make a perfect copy of it. We start 
with our original a qubit in a known pure state | blank), and an arbitrary ancillary system 
(the Cloning Machine) also in a known state, |Mo). Our objective is then to find a unitary 
transformation U such that 

C/(|V) ® |blank) |M » = |V>) ® |V>) ® |M^) . (4.5) 

The role of the ancilla is to give us more freedom in the unitaries which we can perform; we 
allow it to end up in a state |M^) which may depend on the original state If we want this 
unitary to work for any state, in particular it should work for two particular states \tp) and |</>). 
In other words, we require that 

U(\if>) ® |blank) ® |M )) = ® |^) ® |M^> , (4.6) 
U(\<f>) ® jblank) ® |Mo>) = \<f>) ® |0) ® |M^) . (4.7) 

Unitary transformations preserve inner products; if we take the inner product of these states 
before and after the unitary U we find the necessary condition: 

<^) 2 <M^|M )|<|(^) 2 |. (4.8) 

This can be true only when and \ <j>) are either equal or orthogonal to each other. This shows 
the impossibility of a universal, unitary cloning process that is perfect. The simple proof above 
also highlights the fact that orthogonal states can be cloned - which is not surprising, given our 
experience of the classical world and classical computation. 

There are other ways of showing that perfect cloning is impossible. One particularly nice 
proof consists of showing that a perfect cloning machine would enable faster-than-light signalling. 
To see why, let us start by providing two very distant partners, Alice and Bob, with a pair of 
qubits in a maximally entangled state: 

|V> + } = ^(|00) + |11)). (4.9) 

Alice would like to send a superluminal message to Bob. If she wants to send a zero, she 
projects her qubit along a certain direction thq in the Bloch sphere, otherwise she projects along 
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a second direction rft\. Alice's measurement effectively projects Bob's qubit along one of two 
possible directions, too or rn\. If Bob has a perfect cloning machine, he can make a large number 
of copies of his state and do measurements on them to distinguish these two possibilities with 
high accuracy. This would enable him to learn which measurement Alice performed, resulting 
in a faster-than- light communication of one bit from Alice to Bob. 

If the cloning procedure is just a little imperfect, still it is intuitively reasonable to expect that 
for high enough 'copy quality', one would still be able to signal faster than light. This simple 



idea led Gisin [87] to calculate the maximum cloning fidelity which still avoids super luminal 
communication. Interestingly, he found that the limits imposed by the no-signalling condition 
are the same as those imposed by the mathematical structure of quantum mechanics, which we 
will investigate in the next section. 

Next I will present the main features of the known approximate quantum cloning procedures. 

4.3.2 Quantum cloning machines 

We have just seen that perfect copying of an unknown quantum state is impossible. In this 
section we will have a look at some of the known cloning procedures, developed to allow optimal 
distribution of quantum information. 

There are two main types of cloning procedures. The first approach consists of using an- 
cillary quantum systems and a global unitary operation to obtain multiple imperfect clones of 
a given, unknown quantum state. These universal quantum cloning machines (UQCM's) were 



first proposed by Buzek and Hillery [40 1 , and are discussed in section 4.3.3 . 

A second approach combines a unitary evolution step with a measurement on a subsystem, 
which may sometimes collapse the rest of the system into perfect copies of the original state. This 
does not violate the no-cloning theorem, as it only works with probability p < 1. Moreover, we 
will see that for it to work we need to be sure that the original qubit is in one out of a known set 
of linearly independent states; thus, it is not universal, unlike the UQCM's. These probabilistic 



cloning machines were first analysed by Duan and Guo |[T1|, and are described in section 4.3.5. 



4.3.3 Universal quantum cloning machines 

In this section we will have a look at the universal quantum cloning machines (UQCM's), 
introduced by Buzek and Hillery in 1996 |4(J and developed by other authors [88, |208| , |119| , |34| , 

For simplicity, let us start by discussing the original UQCM that makes two clones of a single 
qubit state \ifii n ) [40]. The UQCM consists of a two-step procedure. Firstly, a 4-dimensional 



ancilla quantum system in a known state [blank) is coupled to the state \ipin) to be cloned. Then, 
a carefully chosen unitary U is performed over the combined 8-dimensional system, resulting in 
a final output state |Vw) containing the two clones: 

U{\ijj in ) ® |blank)) = |Vw) • (4.10) 

The final state \ipout) 'contains' the clones in the following sense: each of the first 2 qubit's 
reduced density matrices is in the state 

Pout = V \l/Jin) (V'ml + (1 -V)^' ( 4 - U ) 

where n is a yet undefined parameter which indicates the 'quality' of the clones, i.e. how closely 
they resemble the original state. The third qubit is not a clone; it is present to make possible 
the unitary that creates the two clones. 



If it were possible to find U such that rj = 1 in eq. (4.11), then we would have a perfect 
cloning machine. An optimal UQCM can be found by obtaining a unitary U that maximises rj 
and the fidelity of the clones with respect to the original state: 

F = (lpin\Pout\^in) = V + (1 - = J + I" ( 412 ) 
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The fidelity, as defined above, gives the probability that the clones will pass a test designed to 
check whether they are equal to \ipi n )- 

The development of the theory of cloning machines started with Buzek and Hillery , who 
showed that there exists a UQCM that makes two clones of a single qubit with F = |. Gisin and 
Massar |88| and BruB et al. |32| proved that this fidelity could not be surpassed, and therefore 
that this cloning machine was optimal. In addition, Gisin and Massar obtained fidelities for a 
family of N — > M UQCM's for a qubit, that is, cloning machines that have as an input N copies 
of the original qubit, and which output M clones. Brufi, Macchiavello and Ekert |33| proved 
that these UQCM's were actually optimal. The optimal clone fidelity for N — > M UQCM's for 
qubits was found to be: 



F 



NM + N + M 
M(N + 2) 



were 



Cloning machines for d-dimensional systems (d > 2) 
[ 208| , 119]. They showed that the optimal clone fidelity for a N 
systems is: 



clone 



M -N + N(M + d) 
M(N + d) ' 



(4.13) 

studied by Werner and Keyl 
• M machine for (i-dimensional 



(4.14) 



This last equation summarises and extends the previous results on optimal clone fidelities. 
Other work on cloning includes various analyses of quantum networks for implementing 

rig, 



cloning machines |5_ 
Gisin and Massar 



39, 



41 1 and some experimental realizations of cloning [176, p3 



state 



make an interesting connection between cloning machines am 
estimation, which will be important for the application to quantum computing we will develop 
later. 

State estimation is a very important problem in quantum information theory, and concerns 
the strategies that must be used to obtain maximum information about given properties of 
quantum states, through measurements on them. The simplest quantum state estimation sce- 
nario involves just a single qubit in an unknown state \ijjin)- Our task is then to make some 
measurement on the qubit, in order to obtain classical information about its state. This classical 
information must enable us to prepare a guess state \tpg U ess), which is to be 'as close as possible' 
to the original state \ipi n ). The distance measure we will use is again the fidelity of the guess 
with respect to the original: 



F 



guess/ 



(4.15) 



Massar and Popescu [143] found the optimal fidelity obtainable when we do state estimation 
in the above sense, using N qubits prepared identically in the same state \ip gU ess)- They obtained 



F 



N + l 
N + 2' 



(4.16) 



They noticed that eq. ( 4.16 ) is the li mit, as M — > oo, of the expression for the fidelity of the 
optimal N — > M cloning machine, eq. ( [4.13 ). As they observed (and was further discussed in 
]p3| ), the two processes are equivalent. To see this, it is enough to note that an infinite number 
of clones with fidelity F provides us with means to accurately describe the state of each clone 
(by characterising the states through measurements on the infinite ensemble of clones). This, 
in itself, provides a recipe to create a mixed-state approximation with fidelity F for the state 
estimation problem. 

It is important to note that the state estimation fidelity for a qubit [eq. (4-16)] is always 
smaller than the cloning fidelity ( 4.13| ). This reflects a fundamental characteristic of quantum 
information: the measurement process only reveals part of the information content of a single 
quantum state. By cloning a state, instead of measuring it, we choose to preserve as much as 
possible the quantum information encoded in it, which can be useful in subsequent quantum 
information processing tasks. This is the main idea behind the use of cloning in quantum 
computation tasks, an issue I will discuss in chapter |6[ 
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4.3.4 A curious fidelity balance result 

Before we resume our review of quantum cloning processes, I would like to take a short detour 
and discuss a curious fidelity balance result I found for general, universal quantum cloning 
machines. Let us start by having a closer look at the simplest UQCM, the 1 — > 2 machine for a 
single qubit. The original qubit state can be said to contain maximal quantum information, and 
by cloning it we will be distributing this quantum information unitarily among the two copies. 
Is there a way to characterise quantitatively this quantum information distribution? 

First, we need to distinguish between the merely classical and the genuinely quantum in- 
formation content present in the qubit state. There is no unique way of doing this; here I will 
proceed in a intuitively reasonable way, which reveals a curious fidelity balance result for optimal 
UQCM's. 

There is only a limited amount of classical information that can be gathered from a single 
qubit state through measurement. One way to quantify this classical information content is 
by using the fidelity of the state an optimal measurement procedure enables us to create, with 
respect to the original state. As we have seen in the previous section, this fidelity is simply the 
optimal estimation fidelity, and for a single qubit is equal to 2/3 [from eq. ( 4.16j )]. Let us use 



this optimal estimation fidelity as our measure of classical information present in a given pure 
state, before cloning occurs: 

jbefore = 3/3 

The original state, however, also contains quantum information which is not available through 
the measurement process. A measure of this quantum information content is given simply by 
the original fidelity (= 1) minus this classical estimation fidelity (= 2/3). We thus define the 
quantum information content of the original state as the difference between these two fidelities: 

jbefore = !_ 2 /3 = 1/3. 



After the cloning process, we obtain two imperfect clones, each with fidelity 5/6 [see eq.(4.13)]. 
The classical information content is still the same; one way to see that is to note that the cloning 
process is unitary, and therefore can be reversed, resulting in the original state which we can 
then optimally estimate. Therefore: 

jafter jbefore ^ /g 

The equality of this classical information content is just reflecting the fact that a unitary on 
the original state (even with ancillas) preserves the original classical information, as available 
by optimal state estimation. 

What is the quantum information content now? Each clone still has a higher fidelity than 
is possible to attain by classical information extraction through measurement from the original 
state. The difference between this fidelity and the optimal state estimation fidelity will be the 
intrinsically quantum information content of each clone. Since we have two clones, we obtain: 

= 2(5/6 - 2/3) = 1/3. 

We see that the 'quantum information content' (as I defined above) is conserved in this simple 
1^2 qubit UQCM. 

This might be seen as a numerical coincidence, as I have not presented an argument for why 
this should hold in general. It can be checked, however, that this conservation between classical 
and quantum fidelities also holds in the gene ral ca se of symmetric N — > M cloning machines in 
any dimension, as can be verified using eq. ( [4.14 ). Intuitively, what we see is that the original 



quantum information content (which cannot be obtained through measurement) is conserved in 
the unitary evolution characterising the cloning process, being equally distributed among the 
clones. 

This is a curious result. The way I defined the quantum information content was somewhat 
arbitrary, but still it is intriguing that it should be conserved during cloning, and decreased 
if we do any non-unitary (i.e. measurement) operation on the system. Unfortunately, the I q 
quantity I defined above is not conserved in asymmetric cloning procedures. Still, this type of 
approach may prove fruitful in quantifying the genuinely quantum information content of states 
undergoing different types of operations. 
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4.3.5 Probabilistic cloning 



In section 4.3.1 we saw that there exists no unitary operation which deterministically obtains two 
perfect copies of an unknown state (the no-cloning theorem). This, however, does not preclude 
the existence of a probabilistic non-unitary process which succeeds only part of the time (that 
is, with probability p < 1). These probabilistic cloning machines were discovered by Duan and 



Guo in 1998 [71|. In this section I describe their main results. 

The basic idea can be understood with the following simple example. Suppose we are given 
qubit A, which is known to be in one out of two non-orthogonal states: either |^q) or As 
with the simplest UQCM, we start by adding an ancilla system, consisting of two qubits (which 
we denote by B and C). Qubit B starts in a blank state; if the cloning is successful, at the end 
it will be in the same state as the undisturbed qubit A. Qubit C will be used as a probe to be 
measured at the end. Its role is two-fold: first, it enables us to collapse the state of the other 
two qubits into two perfect copies of the original state; besides, the measurement outcome will 
inform us whether the cloning process was successful or not. 

For this scheme to work, we must find a unitary U over the three qubits, which has the 
following action: 



u (\^o) A |blank) B \p ) c ) = a 00 \ip ) A \ip ) B \p ) c + a 01 \<p ) AB \pi) c > ( 4 - 17 ) 
U(\ipi) A (blank) B \p ) c ) = oio \ipi) A I^i)b \po)c + fl n ab \pi)c • ( 4 - 18 ) 

If we can perform the above unitary and measure qubit C, whenever we obtain the outcome 
\po)c, we will have succeeded in collapsing qubits A and B into two copies of the original state 
of qubit A. The question is, then, to build the unitary U and to obtain the optimal probability 
of success, which will depend on the coefficients aoo and aio- 

In ref. [71] Duan and Guo showed that it is possible to build such a unitary to clone a 
set of non-orthogonal states if and only if the states are linearly independent. Their proof was 
constructive, and enables us to find the optimal cloning probability. The proofs are lengthy and 



can be found in 71]; here I will just reproduce the results which will be useful for the discussion 



of the uses of cloning in quantum computing. 



Theorem 4.1 (Duan and Guo [|7l| ) A set S of pure quantum states can be probabilistically 
cloned if and only if the states in S are linearly independent. 



This theorem shows that, unlike the UQCM's of section 4.3.3, the probabilistic cloning 
machines are state-dependent and only work for states from a previously defined set of linearly- 
independent states. 

Let the above-mentioned set S consist of the n linearly independent pure states S = 
{■01, ip2, • • • ,ipn}- Let us define the n x n matrices 

X$$ = M1>i), (4-19) 

*$ = Wi) 2 . (4-20) 

Now we also define a diagonal efficiency matrix T as 

r = diag (71,72, ...,7n) • (4.21) 

The coefficients 71,72, •••,7n appearing in the definition above are the efficiencies (i.e. prob- 
ability of success) associated with cloning each of the states ipi, ip2, VVi- From the definition 

for the matrix T it follows that y/T = \/T + = diag (1/7I, • • • , \/ln) ■ 

All these definitions are necessary to state the theorem which gives a necessary and sufficient 
condition for probabilistically cloning a set S of linearly independent pure states: 



Theorem 4.2 (Duan and Guo [ 71 1 ) The linearly-independent states in S can be probabilis- 
tically cloned with a diagonal efficiency matrix V if and only if the matrix T = — 
VTXp^VT + is positive semidefinite. 
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The semi-positivity condition on matrix T results in a series of inequalities involving the 
efficiencies Solving these inequalities gives us the achievable cloning probabilities. 

Suppose we take one of the linearly-independent states in S and manage to probabilistically 
clone it into an arbitrarily large number of copies. By performing optimal state estimation on 
them, we can perfectly identify the original state. This connection between cloning and state 
estimation enabled Duan and Guo to establish the following theorem: 



Theorem 4.3 (Duan and Guo [71|) The linearly-independent states in S can be unambigu- 
ously identified with probabilities given by the matrix T if and only if the matrix X^ 1 ' — T 
is positive semidefinite. 

Again, the semi-positivity of matrix — T results in inequalities expressing the achievable 
state identification probabilities. 



Chapter 5 

Tripartite entanglement and relative 
entropy 



In chapter |3| we have reviewed some approaches to the problem of quantifying pure-state en- 
tanglement. We have also seen that the problem of quantification of entanglement for three or 
more parties is much harder than the bipartite case, in part because many of the theorems that 
apply to the bipartite case (such as the existence of a Schmidt decomposition [151, section 2.5]) 
do not carry over to many parties. 

In this chapter we will investigate pure state tripartite entanglement in the asymptotic 
limit of manipulations of a large number of copies of states. More specifically, we will try 
to characterise a Minimal Entanglement Generating Set (MREGS) for tripartite entanglement. 
Basically, we would like to find a set of tripartite states that play the role of maximally entangled 
EPR pairs in the bipartite case, i.e. from which any pure tripartite state can be obtained through 
reversible local operations with classical communication (LOCC) in the asymptotic regime. 

We start in section 5.1 with an introduction to this problem, followed in section [5^ by a 
review of some previous results by other authors. We will see that there exist relations between 
entanglement measures that need to be obeyed by tripartite states which can be reversibly 
obtainable from EPR pairs and GHZ states. These conditions establish relations between the 
problem of additivity for the relative entropy of entanglement, and the problem of finding an 
MREGS for tripartite entanglement. The first problem is important because it relates to an 
unsuspected mathematical property of one of the few entanglement measures that can be readily 
generalised for multi-particle states. The second, apparently unrelated problem will be shown 
to lead to the identification of tripartite states with interesting entanglement structures, which 
may be useful in n ew quantum information processing protocols. 

In section ^3 I present the results of numerical investigations I did, identifying families 
of states which either violate additivity of relative entropy, or which fail to be asymptotically 
equivalent to collections of GHZ and EPR states. When this work was finished, there were 
no examples of states with either of these interesting properties. Sub sequ ently, other authors 
managed to prove sub-additivity of the relative entropy for some states [ [203j , and also that GHZ 
and EPR pairs do not form an MREGS for tripartite states ||. This last paper in particular 
succeeded by further investigating a family of states we pointed out in the published version of 
this chapter ^4|, which I review in section |TJ|. This shows my approach here seems to have been 
fruitful, as it drew attention to the relation between these two problems and spurned further 
research into the entanglement structure of some families of tripartite states. 

By numerically calculating the relative entropy, in section 5.5 1 show that there is a tripartite 
state which is more entangled than the three-qubit state usually referred to as 'maximally 
entangled', the GHZ state. In section 5.6 I close the chapter with a summary of the main 
results. 
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5.1 MREGS for multipartite entanglement 

In this section we address the problem of asymptotically reversible entanglement transformations 
for multipartite entanglement. In section [O] we reviewed the theoretical framework proposed 
in fj"5|| , defining the notion of a Minimal Reversible Entanglement Generating Set (MREGS) for 
creating multipartite pure entangled states. In the bipartite case we have seen that any pure 
entangled state can be reversibly created with LOCC only on copies of a maximally entangled 
pair of qubits, in the asymptotic limit of manipulations of many copies. This means a single 
EPR pair of qubits qualifies as an MREGS for bipartite pure-state entanglement. 

The question we will investigate here is whether it is possible to define an MREGS for 
multipartite entanglement generation. In particular, we will address the problem of tripartite 
entanglement. 

Before I describe my work on this subject, it is convenient to review some results by other 
researchers. A key motivation for my work was the proof that generalised GHZ states alone do 
not constitute an MREGS for four-particle entanglement. This was obtained by Wu and Zhang 
J215| ], and can be more precisely stated as follows. 

Suppose that, in analogy with the bipartite case, we can create any four-partite entangled 
state using reversible LOCC on a large number of states of only a few types, each of which is 
represented in an MREGS for 4-partite states, which we will call G4. From our experience with 
the bipartite case, it is natural to conjecture that G4 may consist of: 

G 4 = { \^ + ) AB , \^ + ) AC > \^ + )ad > \^ + )bc > \^ + )bd > \^ + )cd > 

I ghz )abc ' I ghz )abd > I ghz )acd ' I ghz )bcd j 

\Gm A BCD }■ (5-1) 

In the set above we included all maximally entangled EPR pairs between all pairs of parties, 
plus GHZ states 

|GHZ) 4jfc = l/v^GOiOjOfc) + I 
of all possible combinations of three parties i,j and k, plus the GHZ state of all four parties 



\GRZ) ABCD = 1/V2 (\0 A 0b0 c 0d) + \1 A 1b1 C 1d)) • (5.2) 

The conjecture here is that from the states in G4, it is possible to generate any pure entangled 
state of four parties, reversibly in the asymptotic limit. 



In [215] Wu and Zhang showed that this conjecture is false, i.e. set G4 is not an MREGS 
for pure states of four parties. Their argument relied on calculating some combinations of sums 
of von Neumann entropies of arbitrary tensor products of states in G4 for all possible cuts into 
two parties, and showing these were inconsistent with any possible combination of states in G4 
for the particular state 

j^) = I (|0000) + |0110) + |1001) - |1111» • (5.3) 

Besides the fundamental importance of the result, this approach is also promising because it 
identifies states with interesting entanglement structures, which can be useful for new quantum 
information appl icati ons. For example, state \r]) above has appeared in the work of Briegel and 



Raussendorf 24, 163], who proposed a quantum computation model based solely on a sequence 



of single-qubit measurements on carefully designed, highly entangled cluster states of many 



qubits. State \rj) above was pointed out in [24] as a simple example of such a cluster state of 



four qubits. The same state reappeared in [jj-95|, this time as a representative of the generic 



class of four-qubit entangled states under single-copy LOCC manipulations. 

The result of Wu and Zhang prompts the question about the structure for the MREGS for 



tripartite states. Linden et al. addressed the tripartite case in [137], which actually appeared 
before the results of Wu and Zhang. They showed that the three-party GHZ state and col- 
lections of two-party EPR states between the three pairs of parties are indeed asymptotically 
inequivalent. This means an MREGS for tripartite entanglement cannot consist only of EPR 
states between all pairs of parties. 



In the next section we review the results of Linden et o/.'s paper |137|, so that I can state 
my main results on this problem. 
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5.2 Tripartite entanglement and relative entropy 

Given the asymptotic inequivalence of 3-party GHZ states and pairs of EPR states, we can state 
a conjecture: 

Conjecture : Set G 3 = {\^P + ) AB , \ip + ) Ac , \^ + ) BC , \GHZ) ABC } 

is an MREGS for tripartite pure states. (5-4) 



Some evidence supporting this conjecture was provided in [ 200 1 , which de scri bes a class of G3- 



equivalent states (for the definitions of asymptotic equi valen ce see section ^ 



Regarding this conjecture, Linden et al. showed in [ 137 1 that any G^-equivalent state must 
obey certain relations between two different entanglement measures: the von Neumann entropy S 
of each party's subsystem, and the regularised relative entropy of entanglement E reg of two-party 



reduced density matrices (see section 3.5.2 for the relevant definitions). Since those relations 
will be important for the work I present in this chapter, let us see how we can establish them. 

Let us start by remembering that a tripartite pure state \<f>) ABC is said to be G3-equivalent 
if there exist LOCC operations that convert \<t>) abc arbitrarily good approximations of 
tensor products of the states in G3, reversibly in the asymptotic limit of many copies. These 
asymptotically reversible transformations can be represented as: 

Tbc L °^ C k + )AB AB ® Wag" ® \^)%c BC ® IGHZffi (5-5) 

where Nsab, Nsab, Nsab and Ng represent the number of copies of each state in G3 necessary 
for the rev ersible creation of ./V copies of \4>) A BCi as ^ ~~ * 00 • 



In [ 137 ] Linden et al. showed that the regularised version of the relative entropy of entangle- 
ment E reg for any two-party cut of a multipartite state must be invariant under asymptotically 
reversible LOCC. It is also known that the von Neumann entropy must also remain the same 
duri ng asymptotically reversible LOCC transformations. In the case of £r3-equivalent states 
( |5.5|) , this means that the two-party E Teg and von Neumann entropies must be the same on the 



two sides of equation (5.5) 



To see more explicitly the consequences of this resul t, le t us first calculate the reduced density 
matrix oab of the state on the right-hand side of eq. (|5.5|). It can be readily shown to be 



oab = iv + > <v> + r As ® (ii°)(°u + !ii>uur Ac 

® G|0)(0| B + l\l)(l\ B f SBC (||00)(00Ufl + ±\ll)(U\ABf 9 ■ (5.6) 

Since the von Neumann entropy is additive for tensor products, it is easy to calculate it for gab 
above, and also for the other similar two-party reduced density matrices. We obtain 

S{p AB ) = S{o c ) = sac + sbc + 9, (5-7) 
S(a A c) = S(a B ) = sab + sbc + 9, (5.8) 
S(a B c) = S(a A ) = sab + sac + 9- (5.9) 

Now let us calculate E re9 for the same two-party density matrices of the right-hand side of 
eq. (|5.5|). For that we need two results: for pure states E is equal to the von Neumann entropy, 



and therefore i s ad d itive in this case [192]; and E{gab ® Psep) = E{gab) for separable p sep and 



arbitrary gab [161, 162]. Using these two facts we can calculate E reg for the two-party density 



matrices ctab^ac and obc'- 



E Te9 {a AB ) = E{o A b) = sab, (5.10) 
E re9 {o AC ) = E(a A c) = SAC, (5.11) 
E re9 {a B c) = E{obc) = sbc. (5.12) 
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Be caus e of the invariance of S and E reg under asym ptot ically reversible LOCC, equations 
( 5.12D must hold also for the left-hand side of eq. (|5.5| ), i.e. for any G^-equivalent state. 



These equations can be succinctly summarised as: 

E reg ( aij ) = 8ij (5.13) 

S(a A ) = g + sab + sac (5.14) 

S((Jb) = g + sab + s B c (5.15) 

S(<Jc) = g + sac + sbc, (5.16) 

where E reg {aij) is the regularised relative entropy of entanglement of the reduced density matrix 
Uij of parties i, j. 

Let us now see how we can use the relations established above to make a connection be- 
tween the problem of G^-equivalence and the question of additivity of the relative entropy of 
entanglement. 

5.2.1 Sub-additivity of E and tripartite MREGS 



Suppose we find a state incompatible with relations ( 5.13| )- (|5.16] ). Because of the results reviewed 



above, this state would be asymptotically irreducible to the set G3, and therefore would represent 
asymptotic tripartite entanglement of a type genuinely different from that of a GHZ state. 
Unfortunately, only very recently were methods to compute E reg developed |7]], and they only 
work for a restricted class of states, and for a slight modification of the relative entropy we will 
discuss later. 

Despite this, I managed to make some progress in establishing relations between additivity 
questions and the structure of a possible MREGS for tripartite pure states. My approach was 
to search for states that fail to obey the following relations: 

E{a, l3 ) = Sij (5.17) 

S{a A ) = g + sab + sac (5.18) 

S(ctb) = g + sab + sbc (5.19) 

S(a c ) = g + s AC + sbc, (5.20) 



which are identical to the relations ( |5.13| )-( 5.16| ), except that here we have the single-copy 



relative entropy of entanglement E, instead of its regularised version. At the time this work 
was completed, it was c o nject ured that E was fully additiv e. If t hat w ere the case, any state 
violating relations ( [5.17|)- ( |5.20| ) would automatically violate ( j5.13|) -( |5.16| ) as well, as in that case 
E = E reg (see footnotef]). 

Through numerical c alcula ti ons a nd analytical work I identified families of states which fail 
to comply with relations ( 5.17|) -( |5.20 ), and are likely candidates for G3-irreducibility. These will 



be presented in section |5.4[ 

So what does it mean to show that there are states which are incompatible with relations 
(|5l7|)-(|5T2C|)? Basically, it means that from the two statements below, at least one must be 



false: 

1- all tripartite pure states of three qubits (an in particular the states I discuss) are asymptot- 

ically reducible to set G3, i.e. G3 is an MREGS for asymptotic tripartite entanglement; 

2- the relative entropy of entanglement is generally additive, or at least it is for the states I 



present in section 5.4 



lr The original paper |137[ stated relations ( 5. 17 )-( 5.2C ) instead of the correct ( 5. 13 )-(p. 16|) . This relied on the 



implicit assumption that E(aij) is asymptotically additive, which was subsequently shown to be false in |203 
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When most of the work reported in this chapter was published [84], it drew attention to 
the relation between these two problems, which were then open. In relation to statement 2, 
a definitive proof that the relative entropy of entanglement is sub-a dditi ve for some 3x3- 
dimensional stat es w a s sub sequently found by Vollbrecht and Werner 203 1 . Despite extensive 
numerical work [111, 192], the additivity question is still op en for two-qubit states, and in 



particular for all such states studied numerically in section 5.4 below. 

This suggests s tatem ent 1 above is likely to be false. Indeed, following our suggestion 
presented in section 5.4.3| (and published in [Q ) , Aci' n, Vid a l and Cirac showed that purifications 
of a certain bound-entangled state violate relations ( 5.13j )-( |5~16| ) and are not G3-equivalent ||. 
This resulted in a definitive refutati on of c onjec ture ( |5.4| ), without the as sum pt ion o f additivity 
that makes E reg = E and relations (|5.17[)-(^20|) equivalent to relations (|5.13|)-(|5~1~6|). 



5.3 Calculating the relative entropy of entanglement 

In this section I will describe how we can compute the relative entropy E numerically for general 
states. With E of the two-party reduced density matrices Gij and the von Neumann entropies 
S(<jj) we can check whether relations ( |5.17| )-( |5.20| ) hold or not. Using this method, in the next 
section I will explicitly list families of states which do not obey those relations. As discussed 
above, this has implications for the question of additivity of E and also for the possible structure 
of an MREGS for tripartite pure states. 

But how do we go about numerically calculating El This problem was discussed in [192], 



and here we describe some of the properties of E which allows us to do this for small-dimensional 



systems. In section 3.1 we have seen that a general separable density matrix can be written as 

K K 



Psep 



^Pi\<j)i){<l)i\ A \(t>i){(l>i\ B ---\<j)i){<t>i\ N , ^2pi = l, (5.21) 



where \4>i)j represents a pure state belonging to party j. The first question we can ask is how 
many terms K this expression needs to have. If K cannot be upper-bounded or is too high, no 
numerical procedure can be efficiently implemented to compute E. Fortunately, for two-qubit 



systems it is known [211, 170] that K < 4. More generally, it has been shown [|110| ] that for 
systems of dimensionality d,A X &b we need at most K < (dAcls) 2 terms. 

Let us now consider how to determine the relative entropy E(gab), where gab is a general 
two-qubit density matrix. The results cited above show that any two-qubit separable state pTo 
can be parametrised by a convex combination of at most four separable states: 

4 4 

p s ab = ^2pi l&X&L I&X&Ib ' ^2pi = 1 - ( 5 - 22 ) 

i=l i=l 

Each two-qubit product state in the equation above can be parametrised by 2 x 2 = 4 real 
parameters, and because YliPi = 1> the total number of real parameters in the parametrization 
for pY B is 3 + 4 x 4 = 19. 

To calculate the relative entropy of entanglement we need to do a search over this 19- 
parameter space to find the separable state pab which minimises the following function: 

E{gab) = min D(a AB \\pA B ) (5.23) 
Pab 

= Tr {a A B log gab) ~ min Tr (g A b log pab) (5.24) 

Pab 

= -S(g A b) ~ miiiTr (gab log pab) (5.25) 
Pab 

For a given p^g, the evaluation of the function above involves the diagonalisation of Pa B i 

taking the logarithms of its eigenvalues, and calculating Tr (gab log Pab) m ^ ne basis which 
diagonalises Pab- 
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Finding the global minimum of a function denned in such a large parameter space can be 



very hard. Fortunately, the relative entropy has the useful property of convexity |3y, 7C]. This, 
together with the convexity of the set of separable states, guarantees that any local minimum of 
the relative entropy is also a global minimum. This makes a simple gradient search algorithm 
efficient to find the minimum for the function above, and calculate the relative entropy of 
entanglement. We start with a given separable state and calculate the approximate gradient 
of the relative entropy, for small changes in all directions in the parameter space. With this 
approximation of the gradient we can move towards a new separable state which is strictly closer 
to the absolute minimum, and start the process again. After a few thousand iterations we reach 
the local minimum, which will also be the global minimum because of convexity, as we have just 
mentioned. 

For the gradient search to work it is important to choose a parametrization that does not 
have discontinuities for close separable states. In my program I used the suitable parametrization 



presented in 192 



5.4 Suspicious states 



In this section I present tripartite states I found to be incompatible with relations ( 5.17| )- fl5.20 ) 
using the numerical methods described in the last section. These include the class of states 
known as VF-type, and also a family which I called A states. 

I also provide an analytical argument identifying a large class of states as likely candidates 
to violate relations ( |5.17 )-( 5T20| ) and also ( 5.13 )-( |5T6 ), and thus contradict conjecture (5.4). 
These are the purifications of bound-entangled states, and they were in fact proven to be G3- 
inequivalent in || , following publication of the argument presented in section |5.4.3 in Q| . 



5.4.1 A states 

These are states of the form: 



\A(a,b)) = a|000) +6|100) + 6|101) + 6|110) +6|111> 



(5.26) 



An interesting property of states \A(a,b)) is that obc is separable for all a , b. This follows 
from the fact that o~bc is positive under partial trans positi o n (see section |3.1.1| ). 

Numerical tests show this family violates relations ( 5.17 )-( j5.20|) . For example, when a = b = 
1 / y/b we have 



0~AB = 0~ AC 
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(5.27) 



(5.28) 



If state |A(l/\/5, l/v^)} obeys equations (gl7| ) - (|^2C| ), we would need to have E{gac) equal 
to S(ua) — S{ub) — 0.1541. My numerical calculation found E(u ac) — 0-197 1, which is over 
25% higher than the required 0.154 1 for agree ment with equations fl5.17j )-( |5.20| ). 

These states violate relations ( pM7 )-( |5.20D . Now let us assume that E is asymptotically 
additive for all two-qubit states. From our previous discussion it follows that they would also 
violate ( |5.13| )-( |5TT6| ) and would be G3-inequivalent. This would bring interesting consequences 
for the minimal cardinality for a tripartite entanglement MREGS. 

An MREGS for tripartite entanglement must contain at least the states in G3. The GHZ 
state is necessary to asymptotically generate states which are GHZ-equivalent: all Schmidt- 
decomposable states [16]. The three EPR pairs are needed to asymptotically generate all states 
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with bipartite entanglement only. If the |A(a, b)) states ( 5, 26] ) are indeed GVinequivalent, then 



at least one of them would have to be added to set G3 if we are trying to form an MREGS. 
Let us add one such state to set G3, and see whether the new set can be an MREGS. Our new 
candidate for an MREGS is now: 

G A = G 3 U{|A(1/^5,1/V5))}. (5.29) 

Consider now the states obtainable from |A(l/\/5, l/\/5)} by swapping parties A and B. By 
the same argument given in this section, this new state would also be GVinequivalent. Note 
that now the separable two-party reduced density matrix i s a AC -, i.e. E re9 (aAc) = 0. Because it 
has separable ctac, it is also GA-inequivalent, as relations ( |5.13| )-( |5~T6| ) forbid it to be equivalent 
to any tensor product of states with non-separable oac- 

This argument shows that if |A(a, b)) is indeed G3-inequivalent, then we would need to 
expand the candidate for an MREGS G3 by adding three more states, corresponding to one 
member of the family |A(a, b)) and the two other states obtainable from it by swapping either 
A and B, or A and C. This would mean that the cardinality of a tripartite MREGS would have 
to be at least seven - it would have to contain the GHZ, three EPR pairs, one \A(a,b)} state 
and the two |A(a, 6))-type states obtained from |A(a, &)) through swapping two parties. 



5.4.2 W-type states 

Another class of states violating relations ( 5.17|) -( p.20D are the so-called W states, which have 
already appeared when we discussed single-copy entanglement manipulations in section [3.3| . 
The class of states we will consider are of the form: 

\W(e, /)) = e|000) + /|101> + /|110), (5.30) 

where e and / are real parameters. The two-party reduced density matrices are given by: 



<JAB(e 2 J 2 ) = a A c(e 2 J 2 



<JBc(e 2 J 2 
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(5.31) 



(5.32) 



Let us first check the predictions for E given by equations (|5.17|) -( ^20|) . For those relations to 
hold we would need 

E(a AB ) + S(a AB ) = E{o BC ) + S{a BC ). (5.33) 
The relative entropy of state a B c has been computed in example 1 of |192j1 : 

(5.34) 



E(a BC ) = 2(/ 2 - 1) log 2 (l - f 2 ) + (1 - 2/ 2 ) log 2 (l - 2/ 2 ) 

and by direct computation we find that 

S(a BC ) = "(1 - 2/ 2 ) log 2 (l - 2/ 2 ) - 2/ 2 log 2 (2/ 2 ), 
S{cjab) = ~f 2 log 2 f 2 - (1 - f 2 ) log 2 (l - f 2 ) . 



(5.35) 
(5.36) 



Combining these equations with eq. ( 5.33j ), we see that if E is asymptotically additive for cjab, 
then G3 can only be an MREGS for states of the form ( 5.3C| ) if 



E{a AB ) 



l)log 2 (l - f 2 ) - 2/ 2 log 2 (2/ 2 ) + / 2 log 2 (/ 2 



(5.37) 
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With e = y/2/3 and / = \/l/6 eq. ( |5.37 ) predicts E ~ 0.3167. A numerica l cal c ulatio n 



yields E ~ 0.3548, in disagreement with the prediction obtained from relations ( 5.17j )- (|5.20 ). 



Moreover, in jSJJ an analytical argument was found to corroborate this numerical result, in joint 
work with Martin Plenio and Shashank Virmani 0. 

5.4.3 Purifications of bound-entangled states 

There is a very wide class of states which are good candidates for not being (^-equivalent. These 
are simply all tripartite purifications of bipartite bound-entangled states (see section |3.1.1| ), Let 
me now present an argument for why this is the case. 

When we defined the relative entropy of entanglement Es(ctab) and its regularised version 



E T g 9 {(jAB) in section it was with respect to the set S of separable states. The definition 
makes sense because we wanted to quantify distinguishability of entangled states with respect 
to any state in that set, which is invariant under local unitary operations. We can, however, 
define the relative entropy of entanglement with respect to other local-unitary invariant sets, 
such as the set of PPT states, i.e. states which remain positive after partial transposition. 
This represents a wider class of states, which includes separable states as a subset, but also the 
non-separable PPT states known as bound-entangled states. Because the set of PPT states is 
easier to characterise mathematically than the set of separable states, some results have been 
obtained recently for the relative entropy with respect to PPT states (which we shall denote 
here by Eppt), for example calculations of its regularised version E r p e p T [0, ||. 

As we have seen in the previous sections, any (^-equivalent state \<p) ABC can be obtained 
from G3 through reversible LOCC in a process like: 

\*)abo L °^ C \^)ab B a \^)ac C ® l^fsT ® l GHZ >^c • (5-38) 



The relations (Pl) - (P^) were derived in |137[| by proving that the relative entropy E T g 9 of 



two-party subsystems in such asymptotically reversible transformations must remain constant. 
Their proof relies on general properties of the relative entropy, and also goes through for the 
alternative definition E r p e p T with respect to PPT states. 

The two-party reduced density matrices of the right- hand -side of eq. (|5.3Sj ) above are 
tensor products of pure states with separable states [see eq. (|5,6j) l . For such states it has been 



show n that E r g 9 (oij) = E T p e p T (aij), being equal to the von Neumann entropy S(aij) [192, 161 , 



189]. Because E re9 (aij) is conserved in reversible transformatio ns, th is must be true also of the 
two-party reduced density matrices of the left-hand side of eq. (|5.38j ), i.e. of any (^-equivalent 
state. 

Now consider a tripartite state \(/))abc whose reduced density matrix oab is bound entangled, 
i.e. E T pp T (aAB) = but Es(oab) 7^ (because it is non-separable). Despite the fact that 
Esi&AB) 7^ 0, to be G^-equivalent \4>)abc needs to have E r g 9 (uAB) = 0, so that E T s eg '((tab) = 
E r pp T [pAB) = as required. This is a very peculiar requirement: we need the non-zero Es(<jab) 
to vanish in the asymptotic limit of many copies. Note here that for G3 to be an MREGS, this 
requirement would need to hold for arbitrary tripartite purifications of arbitrary bound-entangled 
states. 

This shows that purifications of arbitrary bipartite bound-entangled states are candidates 
for non-G3 reducibility. Subsequently to the published version of this argument |84[], Acin, Vidal 
and Cirac proved that indeed G3 is not an MREGS for tripartite states by showing that E r $ 9 7^ 
for a particular bipartite bound-entangled state ||. 

2 I decided to omit this proof here since my collaborators' share in it was bigger than mine, and it involved 
more sophisticated mathematical arguments which would require extensive background material to be added to 
this thesis. 
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5.5 Entanglement of W versus GHZ 

It is easy to modify the program for calculating the relative entropy of entanglement E for two- 
qubit states and make it work for arbitrary pure-state systems. I changed the program slightly 
to investigate E for three-qubit states. In this section I will show that in a very operational 
sense, the W state 

|W^) = -^ (|000) + |101) + |110)) (5.39) 

can be shown to be more entangled than the three-qubit GHZ state, contrary to the usual claims 
that the GHZ stat e is t he 'maximally entangled state' of three qubits. 

The W state ( |5.39D has many interesting properties. In [121, |74| it was shown that it is 



the three-qubit state that is maximally robust under disposal of one of the three qubits. More 
precisely, the two-party reduced density matrices of the W state maximise several measures of 



bipartite mixed-state entanglement. For example, the concurrence was defined by Wootters [211] 
as a measure of bipartite entanglement and can be calculated exactly for any two-qubit mixed 
state. It turns out that the W state maximises the average concurrence of the three two-party 



reduced density matrices [121, [74]]. This is in great contrast with the [GHZ) state, which has 
separable reduced density matrices. 

So we see that the bipartite entanglement present in the reduced density matrices is maximal 
for the W, and zero for the GHZ state of three qubits. This, however, may be an unfair way 
to measure the overall entanglement for tripartite states. We know the GHZ state is highly 
entangled, allowing proofs of non-locality without inequalities |j92| , |9l| , [145|| , and improving the 
efficiency of quantum information applications, an example of which we will see in chapter [?|. 
What we can safely say is that the types of entanglement present in the W and the GHZ states 
are different (see section |3~3| ) , and accordingly can be used for different tasks. 

This having been said, there is a very suggestive way of defining what a maximally entangled 
state should consist of. It should be the state which, upon measurements, is easiest to distinguish 
from any separable state. In other words, we can call a state "maximally entangled" when it is 
the state most different from any separable state, i.e. the state which displays entanglement in 



the most manifest way. As we have seen in section |3.5| , Sanov's theorem can be used to show 
that the relative entropy of entanglement is a good measure of distinguishability between a given 
entangled state and any separable state. It is therefore reasonable to call a state maximally 
entangled when it maximises the relative entropy of entanglement, with respect to the set of 
separable states. 

In order to compute E for three-qubit states, we need to parametrise all tri-separable states, 
i.e. states which can be expressed as 

K K 
Psep = ^2Pi\(i>i)A( ( i ) i\A\ ( t > i)B( < f ) i\B\ ( f ) i)c( ( f > i\c> = L ( 5 " 4 °) 



By the arguments of J192j ] it can be shown that an upper bound for the number of terms K is 
(cIa x ds x dc) 2 = 8 2 = 64. Including the probabilities for each term, the total number of real 
parameters describing a three-qubit separable state is given by 63 + 6 x 64 = 447. A gradient 
search over this large parameter space takes a long time but it is still feasible. 

While for the GHZ and simple EPR states E = 1 exactly, for the three-qubit W state ( |5.39| ) 
I numerically found E(W) = 21og(3) — 2 ~ 1.170, which is definitely higher than for the GHZ 
state. We thus see that when we use the relative entropy as a measure of entanglement, it is 
fair to claim that the W state is more entangled than the GHZ state. 

It would be interesting to do some more work on this problem, for example by explicitly 
describing the tests necessary to show this higher degree of entanglement. One interesting 



approach to this problem was put forward by Peres [157], who suggested that a Bayesian analysis 
of experiment outcomes can be used to quantify the 'strength' of locality violation. Basically, 
he uses a Bayesian approach to calculate how many experimental runs are necessary to convince 
a skeptical observer that locality cannot hold, up to a certain level of statistical confidence. 
It would be interesting to perform such an analysis for non-locality tests using W states (for 
example [fi3[]), and compare the results with experiments done with GHZ states. 
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5.6 Chapter summary 

In this chapter we have identified a number of tripartite states for which either the relative 
entropy is sub-additive, or which represent a type of genuine tripartite entanglement different 
from the GHZ type, in the asymptotic limit. Further work by other authors has proved that 
indeed some of those states are not G3 -equivalent. 

These states were singled out because of their interesting entanglement structure, which may 
be useful in new quantum information applications, as has already happened with states ( |5,39| ) 
and (5.3) above. The fact that purifications of bound-entangled states are not G^-equivalent 



suggests that multipartite entanglement generation may be genuinely irreversible, i.e. it may be 
impossible to find MREGS with finite cardinality for entanglement of more than two parties. 

In order to identify candidates for G3-inequivalence I resorted to a numerical calculation of 
the relative entropy of entanglement for two-qubit systems. By applying the same method for 



three-qubit states, I showed that the W state (5.39) is more entangled than the GHZ state, 
in the sense that its measurement statistics are more easily distinguishable from those of any 
separable state. 



Chapter 6 

Cloning and quantum computation 



The COPY operation in classical computing is very useful, as it allows one to make multiple 
copies of the output of some comput ation, that can be fed as the input to further multiple 
processes. As we have seen in section 4.3.2 , the quantum copying (i.e. cloning) is necessarily 
imperf ect, introducing some noise in the second round of computation. This situation is pictured 
in fig. P(a). 

The imperfect nature of the quantum cloning procedure results in lower chances of getting 
the correct computational outputs at the end. Nonetheless, in this chapter I will show that 
in some cases cloning improves our chances of correctly computing both branches, if there are 
constraints on the number of times we can run the first part Uq. I will do this by presenting 
two examples of computations for which cloning offers an advantage which cannot be matched 
by any approach that does not resort to quantum information distribution [as in fig. |6.1| (b)1. 

A reasonable question to ask is in which situations we can expect cloning to be helpful during 
a quantum computation. One might think that cloning could be helpful in state estimation. As 
we have seen in section 4.3.3, state estimation is equivalent to cloning, when the number of 

sr 



copies M — > oo [34, p^ J. In relation to this point, D'Ariano et al. |5^] have shown that cloning 
can be useful for obtaining a good estimation of non-commuting observables of a single qubit, 
by making a few clones and directly measuring different observables in each of them. However, 
optimal state estimation procedures for the same task will in general be different, and more 
efficient, than simply measuring the clones of a Universal Quantum Cloning Machine (UQCM). 

It seems that cloning for quantum computation is most efficient when used in the middle of a 
coherent quantum computation. This is the case because the full specification of an intermediary 
state of the quantum register typically involves a very large amount of classical information, 
which in general is not accessible through measurement. A compelling demonstration of this 
will be presented in chapter ^. By preserving the quantum information in a way classical 
measurement cannot do, cloning ends up offering an advantage over procedures which do not 
distribute quantum information. 

This use of quantum cloning represents and idea more in the spirit of fig. |6.1| (a). In this 
chapter I will give two examples of quantum computation tasks which ca n b e better performed 
if we use quantum cloning in this way. The task we analyse in section |6.1| uses the UQCM's 
presented in section 4.3.3, whereas in section 6.2 the task relies on the probabilistic cloning 
procedure we discussed in section 4.3.5. 



6.1 Example with universal cloning 

The first example I present is based on the scenario introduced in fig. |6.1| (a). It models the 
general situation in which we want to perform M different quantum computations, all of them 
with some first computational steps Uq in common. 

Suppose that we are constrained to run Uq only once. This may happen if Uq is a complex, 
lengthy computation. In this case, we will be forced to find a scheme that obtains the M 
computation results with as large a probability as possible, using Uq only once. Possible schemes 
may or may not resort to cloning to distribute quantum information; the example below is one 
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Figure 6.1: (a) This circuit represents the use of cloning to obtain information about computa- 
tions UiUo and U2U0 using Uo only once. (b)This is one of the no-cloning strategies available 
for the same task. There are other possibilities: for example, we could embed each of the Uj in 
a different quantum circuit, designed specifically for obtaining information about Uj. 



in which cloning enables us to improve our performance, in relation to any scheme in which 
there is no information distribution using cloning. 

In order to specify our task, suppose that we are given (M + 1) quantum black-boxes. 
What black-box j does is to accept one cZ-level quantum system as an input and apply a unitary 
operator Uj to it, producing the evolved state as the output. We may think of the black-boxes as 
quantum oracles, or quantum sub-computations. The Uj are chosen randomly from all possible 
U(d) unitaries, using the unique uniform distribution invariant under action of U(d) (see [)138| ). 

Our task will be to build quantum circuits that use each Uj at most once to create M mixed 
quantum states pj, each as close as possible to \<pj) = UjUo\0) , (j = 1, 2, ...M), where |0) is 
an arbitrary reference state. Our score will be given by the average fidelity of these imperfect 
approximations of the ideal |</>j)'s: 

1 M 

This choice of scoring function evaluates the 'quality' of the approximations we can make to 
each computation output, giving the same weight for each of the M computational branches. 



6.1.1 Score with cloning step 

Now let us see how cloning allows us to obtain a high score F. We accomplish this by using a 
quantum circuit that first applies Uo to the initial state |0), followed by an optimal universal 
cloning machine to obtain M imperfect copies p c of state Uo |0). We then apply each Uj (j = 
1, 2, ...M) to a clone, obtaining reduced density matrices 

Pj = v \0j}(0j\ + (l- v )±l, (6.2) 

where 77 is such as to give us the optimal cloning fidelity given by eq. ( |4. 14 ) with N = 1. Using 
the resulting pj's as our guesses for states \<pj) (j = 1, 2, ...M), we obtain an overall score 

2M + d - 1 

'cloning- M ^+l) ' { 1 

which is just the fidelity for the optimal 1 — ► M cloning machine [see eq. ( |4.14 )1. 
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In the next section we will evaluate the score obtainable without cloning, and we will see 
that it is always smaller than this. In fact, eq. represents the optimal score obtainable for 

this task, at least in the case M = 2. In order to see this, we first note that asymmetric cloning 
(arising when the factors rj are in general different for each copy) is of no help in raising the 
score. This can be deduced from [^] and f49f| , where the authors consider asymmetric cloning 
with M = 2 and show that the sum of the fidelities of the copies is maximised by symmetric 
cloning. Furthermore, the optimality of the universal cloning procedure we have used entails 
optimality for the fidelity of each of the pj, and therefore a maximal value of the score F. This 
shows that this task is optimally performed [with optimal score given by eq. ( |6.3] )1 if and only if 
we are allowed to use cloning. It is straightforward to generalise the result to the case where we 
are allowed to run Uo N times (JV < M), instead of just once, and quantum cloning still offers 
an advantage. 



6.1.2 Score without cloning step 

Now let us evaluate the attainable scores if we do not resort to cloning. If we are not allowed 
to clone the state, there are two simple strategies possible. 

The first strategy relies on obtaining maximal classical information about state Uo |0), and 
using it to run all M branches of the computation as well as possible. It starts by running Uo, 
followed by measurements that accomplish an optimal estimation of the resulting state Uo |0). 
After this, we can use the information gathered to build the M imperfect copies necessary to 
proceed to the second part of the computation with the Uj (j = 1, 2, ...M). As we have seen in 
section |4.3.3 , optimal state estimation yields 'classical clones' with fidelity given by eq. ( f4,14| ) 



with N = 1,M ->■ oo: 

* = (STTy (6 - 4) 

We obtain our approximations of states \4>j) (j = 1,2, ...,M) by applying each Uj (j = 
1, 2, ...M) to on e of t hese classical clones, resulting in a score also given by eq. ( |6.4[) . As we have 



seen in section 4.3.3 , this fidelity is always inferior to the symmetric cloning fidelity presented 
in the previous section. In other words, it is best to preserve the quantum information present 
in the intermediary state fTo 1 0) , distributing it coherently through the cloning process. 

Another strategy which avoids using cloning is to obtain as little classical information as 
possible about the intermediary state. This can be done by obtaining = UiUq\0) with 
fidelity one. Now that we have used Uo and U\ once already, we must make guesses about the 
other (M — 1) states \(pj) by using only the remaining (M — 1) black-boxes. As they were drawn 
from an uniform random distribution, the best we can do is to make random guesses (each on 
average with F = 1/d), obtaining, on average, a score 

1 'l + ^] . (6-5) 



M V d 

This score is also always inferior to that of the cloning strategy. 

The two strategies presented above are extremes of a continuous set of strategies, which 
rely on extracting some classical information from state Uo |0), using it to run M — 1 of the 
computational branches, while preserving part of the quantum information present in that same 
state, using it to run one of the branches. Intuitively, one would expect that extraction of 
classical information can only degrade the quality of the resulting clones, and the overall score. 
This is indeed true, as was confirmed by work on this fidelity-tradeoff balance problem, studied 
by Banaszek and Devetak fi 0]. Their work shows that in order to maximise the sum of the 
fidelities of the clones, it is necessary to avoid doing any measurement that extracts classical 
information from the state. As we have seen above, the symmetric cloning strategy is optimal 
for the task, beating any such combined strategy. 

With this we conclude the analysis of possible no-cloning strategies, and reach the conclusion 
that the optimal strategy is the one using symmetric cloning, which I presented in the previous 
section. 
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6.1.3 Remarks 

This first example models the situation in which we have a series of quantum computations 
with some computational steps Uq in common. We must note that we have assumed complete 
lack of knowledge about the intermediate state Uq |0) and about the final target states UjUo |0) 
(j = l,2,...M). 

In the general case this will not be a good assumption, as many quantum computations will 
output states known to belong to a limited set of states. In some cases, this can be taken into 
account with state-dependent quantum cloning and a different choice of scoring functions. In 
the next section we give an example of this. 

6.2 Example with probabilistic cloning 

In our second example we take the black-boxes of the previous example to consist of arbitrary 
quantum circuits that query a given function only once. The query of function /j is the unitary 
that performs 

\x)\y) -» \x) \y ® fi(x)) , 

where we have used the symbol © to represent the bitwise XOR operation. For ease of analysis, 
we restrict ourselves to the case M = 2 and also restrict the set of possible functions fo, /i and 
/2- Our task will involve determining two functionals, one which depends only on fo and /i, 
and the other on fo and fi. As in the previous example, we will compare the performances of 
cloning and no-cloning strategies. 

In order to precisely state our task, let us start by considering all functions h{ which take 
two bits to one bit. We may represent each such function with four bits a, b, c and d, writing 
h a ,b,c,d to represent the function h such that h(00) = a, h(01) = b, h(10) = c, and h(ll) = d. Let 
us now define some sets of functions that will be helpful in stating our task: 

SfQ = {^QOIOj hoioi, /iiooi }; 

•Si = {^0001, ^0010) ^-0100; ^100o}) 5*2 = {^0000) ^0011, ^0101) hiooi} 

Sfn = Si u S2, 

<Soooo = {^-ooooi him}, Soon = {hoon, /inoo} 5 (6-6) 
£0101 = {^oioi; ^1010}) 5*1001 = {/iiooi) ^-onoli (6.7) 
Sf = 5*0000 U 5oon U 5oioi U 5iooi- 

Now we randomly pick a function fo G Sjq, after which two other functions fx and fi are 
picked from the set Sm, also in a random fashion but obeying the constraints: 

/o®/i,/offi/26S/. (6.8) 

Here we use the symbol © to represent addition modulo 2, which is equivalent to the bitwise 
XOR operation. Constraints Q6.8| ) ensure that both functions fo © f± and /o © /2 are either 
balanced or constant. The idea of imposing this requirement comes from the Deutsch-Josza 
algorithm which works only for functions which are either balanced or constant. 

Our task will be to find in which of the four sets 5oooch 5oon, 5oioi and 5iooi he each of the 
functions /offi/i and /o©/2 ; using quantum circuits that query /o, /1 and /2 at most once each. 
Our score will be given by the average probability of successfully guessing both correctly. 
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Figure 6.2: If function /, is guaranteed to be either in set S\ or in S2, then this quantum circuit 
can be used to distinguish between the four possibilities in each set. This is done by measuring 
the final state \4>i) = X^Loo( — 1)^^ |^) i n one of two orthogonal bases, depending on which set 
contains fa. The H operations are Hadamard gates. 



6.2.1 Score without cloning 

The best no-cloning strategy I have found goes as follows. Firstly, note that if fa = /10010 then 
both fa and fa must be in set Si, because of the constraints given by eq. (|6.8|) ; similarly, if fa 
is either Zioioi or /iiooi, then fa and fa must be in set 5*2. Since we have drawn the function 
fa with a uniform random distribution, we will have both functions fa and fa in set S2 with 
probability p = 2/3. We will assume that this is the case; then we can discriminate between the 
two possibilities for fa with a single, classical function call. Furthermore, by using the quantum 
circuit in fig. ^ twice (once with each of fa and fa) we can distinguish the four possibilities 
for functions fa and fa. 

This happens because depending on which function in set S2 was queried, this quantum 
circuit results in one of the four orthogonal states 



11 



E(-d 



(6.9) 



=00 



This allows us to determine functions fa, fa and fa correctly with probability p = 2/3, in which 
case we can determine which sets contain fa © fa and fa © fa and accomplish our task. Even in 
the case where our initial assumption about fa was wrong, we may still have guessed the right 
sets by chance; a simple analysis shows that our chances of getting both right this way are only 
1/16. On average, then, by using this no-cloning strategy we obtain a score: 



Pi 



2 1 

3 + 3 



1 

16 



0.6875. 



This is the best no-cloning score I could find for this task, and I conjecture that it is optimal 
among no-cloning approaches. 



6.2.2 Score using cloning 

We can do better than that with quantum cloning. The idea now is to devise a quantum circuit 
that queries function fa only once, makes two clones of the resulting state and then queries 
functions fa and fa, one in each branch of the computation. Since we have some information 
about the state produced by one query of fa, the best cloning strate gy will no longer be a 



UQCM, but the probabilistic cloning process described in section 4.3.5 



The quantum circuit that we apply to solve this problem is given in figure |6.3| . 
Immediately after querying function fa, we have one of three possible linearly independent 
states (each corresponding to one of the possible /o's): 



CHAPTER 6. CLONING AND QUANTUM COMPUTATION 



54 



-• — ► 



|0> 
ID- 



•|0)-|1> 



c 

L 

N 
I 

N 
G 



H 









|flag> 



Figure 6.3: The cloning procedure in this circuit is probabilistic; a measurement on the state 
\Flag) tells us whether the cloning succeeded. If the cloning is successful we let the clones go 
through the rest of the circuit, yielding output states \<pi) = \ Y^x=qo(~ l)^ ^®-^^ \x), (i = 1,2). 
These states can be measured in the basis defined by eqs. (|6.15|) - (|6.18| ) to unambiguously decide 
which of the four sets Soooo, ^0011 > £0101 or Siooi contains /o © U- 



|/iooio) = 2[l°°) + l 01 )-| 10 ) + l 11 )]' 
Ifoioi) = ~ [|00> - |01) + 1 10) - |11)] , 
|/iiooi) = ^[-|00) + |01) + |10)-|ll)] 



(6.10) 
(6.11) 
(6.12) 



As we have seen in section 4.3.5, we can build probabilistic cloning machines with different 
cloning efficiencies for each of the states ( 6.10| )-( |B.12 ). Theorem 4.2 (due to Duan and Guo and 
reviewed in section 4.3.5) provides us with inequalities that allow us to derive achievable effi- 
ciencies for the probabilistic cloning process. I did a numerical search that yielded the following 
achievable efficiencies for probabilistically cloning the states in eqs. ( |6.10| )-( 6.12j ): 



71 = 7(1 fcooio)) = 0.14165, 

72 = 7(l^oioi» = 7(^1001)) = 0.57122. 



(6.13) 
(6.14) 



After the cloning process we can measure a 'flag' subsystem and know whether the cloning 
was successful or not. For this particular cloning process, the probability of success is, on 
average, p SUC cess = (71 + 272)/3 ~ 0.4280. Let us suppose that it was successful. Then each of 
the cloning branches goes through the second part of the circuit in fig. |T^, to yield one of the 
four orthogonal states: 



l^oooo) = i[|00) + |01) + |10) + |ll)], 



1 



1 10) 



Ifcoon) = 5 0°°> + I 01 ) 
\hom) = \ [|00> - |01) + 1 10) - 
l^iooi) = \ [- |00) + |01) + |10) 



11)], 
11)], 
-|H)] 



(6.15) 
(6.16) 
(6.17) 
(6.18) 

if and only if the combined 



which can be discriminated unambiguously. We obtain state |/ioooo) 
function /offi/j is one of the two in set 6*0000 , as can be checked by calculating the effect of the cir- 
cuit in fig. |6.3| for all possible /o, fi an d f2- The situation is similar for the other three states; the 
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detection of each of them signals precisely which one of the four sets {SoooO; ^ooii> £oioi> 5*1001 } 
contains fo © fi . As a result, if the cloning process is successful, we manage to accomplish our 
task. 

However, the cloning process will fail with probability (1 — ^success)- If this happens, a simple 
evaluation of the posterior probabilities for function fo shows that it is more likely to be feppi o 
than the other two, thanks to the relatively low cloning efficiency for the state in eq. (|6.10 ), 
in relation to the states in eqs. ( 6.11 ) and ( |6.12| ) [see eqs. ( |6. 13 - 6. 14 )]. If we then guess that 
fo = ^ooiO) we will be right with probability 



Pooio 



(l-7i) 



(l-7i) + 2(l- 72 ) 



0.5002. 



What is more, we are still free to design quantum circuits to obtain information about fx 
and /2, since at this stage we still have not queried them. Given our guess that fo = /looiO; only 
the four functions in S\ can be candidates for f\ and /2, because of the constraints given by eq. 
d6.8| ). These four possibilities can be discriminated unambiguously by running a circuit like that 
of fig. 



6.2 



twice, once with fi and once with fi- The circuit produces one of four orthogonal 
states, each corresponding to one of the four possibilities for /j. Therefore, if our guess that 
fo = ^-ooio was correct, we are able to find the correct fi and /2 and therefore accomplish our 
task. In the case that fo ^ Ziooio after all, we may still have guessed the right sets by chance; a 
simple analysis shows that this will happen with probability 1/16. 

The above considerations lead to an overall probability of success given by 



P2 — Psuccess ~\~ (1 ^success) 

~ 0.7320 > pi = 0.6875, 



. 1 

Pooio + (1 -Pooio)t^ 
lb 



(6.19) 



thus showing that our cloning approach is more efficient than the previous one, which does not 
use cloning. I have not proven that the first approach is the most efficient among those that do 
not resort to cloning, but I conjecture that it is. 



6.2.3 Remarks 

Besides the larger probability of obtaining the correct result, our cloning approach offers another 
advantage: the measurement of the 'flag' state allows us to be confident about having the correct 
result in a larger fraction of our attempts. 

For the probabilistic cloning machines described above this fraction was ~ 0.428, but this can 
be improved by choosing a different cloning machine, characterised by 71 = 0.3485,72 = 0.5258. 
This latter machine signals a guaranteed correct result in a fraction (71 + 272)/3 ~ 0.467 of the 
runs. 

The best no-cloning approach for obtaining these guaranteed correct results would involve 
unambiguous discrimination of the function fo, followed by the distinction among the four 
possibilities for functions f\ and fi (this second step is simple if we know fo for certain). 
Theorem 4.3 (see section 4.3.5| ), due to Duan and Guo, provides us with a tool to numerically 



determine the best efficiency for unambiguous discrimination of fo- A numerical search shows 
that this can be done only with efficiency < 1/3, and therefore this is the limit for the fraction 
of runs for which we can obtain a guaranteed correct result for the task at hand, if we do not 
resort to cloning. 



6.3 Chapter summary 

I presented two examples of quantum computation tasks which attain optimal performance by 
using an intermediate quantum cloning step. In both examples, I showed that preserving and 
distributing quantum information through a cloning process offers advantages over extracting 
classical information mid-way in the quantum computation, for example using optimal state 
estimation. 
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The first example models the situation in which we need to do M quantum computations 
which have a first part in common. When this first part is lengthy or otherwise costly to 
compute, we can obtain good approximations of all M ideal output states by running the first 
part only once, cloning the resulting state and feeding the clones to the multiple subsequent 
computations. It was shown that the average fidelity of the resulting approximate outputs is 
maximised when we use an optimal, symmetric UQCM. 

When performing quantum computations, we may often need to distribute quantum informa- 
tion contained in a state about which we have some partial information. In such cases, UQCM's 
may not be the optimal solution. The second example I presented shows such a case, in which 
we use a probabilistic, state-dependent cloning procedure to distribute quantum information in 
the middle of a quantum computation. 

Of course, general quantum algorithms are already known to manipulate quantum infor- 
mation, distributing it among different parts of the quantum memory during a computation. 
In this chapter I have shown that quantum cloning can be taken as a natural tool to do this 
quantum information distribution, in order to optimise our use of computational resources. It 
would be interesting to find other tasks that could profit from cloning, perhaps by combining 
already known quantum algorithms with some intermediate cloning steps. 



Chapter 7 

On quantum random access codes 



In section |4.2| I described a quantum information application called quantum random access 
codes (QRAC's). In this chapter I will analyse some aspects of this application, stressing the 
fundamental quantum characteristics behind its better-than-classical performance for a type of 
information p roce ssing task. 

In section 7.1 I will show the role quantum contextuality plays in the simplest QRAC. In 
section 7.2 I change the QRAC somewhat to show how it can make use of quantum non-locality 

instead. Next, in section [7^ we will see the relation between the 2 1 QRAC and two quantum 
information protocols known as remote state preparation and dense coding. 

The simplicity of the 2 — ► 1 QRAC suggests it may have been already derived in other 



contexts, with other interests in view. In section |7.4| I will show that the quantum communication 
complexity protocol introduced by Buhrman, Cleve and van Dam in |35| is actually equivalent to 
this simple QRAC. This shows the two apparently dissimilar protocols are actually one and the 
same, and rely on the same quantum characteristics for their higher-than-classical performance. 

Next I will derive some bounds on general QRAC's efficiency di rect ly from the invariant 
information of Brukn er a nd Zeilinger, which we overviewed in section |2,5| . 

Finally, in section |7.6| I will briefly sketch some of the experimental requirements for demon- 
strating the simplest QRAC's in the laboratory. 



7.1 QRAC's and contextuality 

In this section I will show how the higher-than-classical performance of the simplest QRAC can 
be traced back to quantum contextuality. Later we will see its relation to quantum non-locality 
as well. 



In section |4.2.1| we have seen that the simplest classical random access code consists of asking 
Alice to encode her two-bit string 60^1 into a single-bit message, which Bob must decode to read 
one of Alice's bits, but not both at the same time. As we have seen, the optimal classical 
probability can be attained through the obvious protocol consisting of just sending one of the 
bits, which Bob can then always 'decode'. If he is required to read the unsent bit, then he just 
guesses it, getting it right half of the time. This trivial code results in an average probability of 
success of p c = \ (l + |) = |. 

We have also seen how quantum mechanics enables us to perform the task with a higher 
probability of success by sending a qubit, instead of a classical bit. Alice prepares one of four 
non-orthogonal states of a qubit depending on her two-bit string, and sends it to Bob. By 
appropriate measurements on this single qubit, Bob manages to read out Alice's first bit or the 



second, but not both, with a high probability of success p q ~ 0.85 (see figure 4.1). 

The preparation of the encoding state can be done in a two-step process, in which Alice first 
creates an entangled state of dimension d > 4, and then prepares a qubit state through suitable 
projective measurements on a subsystem. 
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Let us now investigate one possible way of doing this, which will make apparent the role 
of quantum contextuality in the higher-than-classical performance of this task. Alice starts by 
preparing 

k + ) = ^(\0A0 B ) + \lAl B )). (7.1) 



Let us use the same qubit parametrisation as in eq. (|2,9[): 

|V(M)> = cos(|)|0)+exp(^)sin(|)|l). (7.2) 
In order to prepare qubit B in the desired state, she measures qubit A with one of the two 



observables used for maximal violation of the CHSH inequality (see section 2.3.1). As discussed 
in section 2.3. 1| , these projectors must alternate on the same plane in the Bloch sphere, separated 



by angles of 7r/4 radians. For ease of comparison with fig. 4.1 depicting the encoding states, 
we can choose the following projectors. If Alice's bit-string fto^i = 00 or 11 she will measure 
projector P a 2 = P(tt/2,7t/A); if fro^i = 01 or 10 she measures P a \ = P(ir/2, 7tt/A). She then 
performs a unitary rotation by tt radians around the z axis on qubit B, conditional on the 
outcome of her measurement. This enables her to deterministically prepare any of the four 
states used in the protocol. 

As in the CHSH test, Bob's readout measurements are made using observables different from 
those used by Alice to prepare the states. He will measure qubit B with projector P&i = P(tt/2, 0) 
if he want to read bit bi, and P^ = P(tt/2,7t/2) if he want to read bit &2- As we discussed in 



section 4.2.1, the measurement outcomes give him a 2— >1 QRAC a probability of success which 
is higher than the optimal classical analogue. 

This way of presenting the 2— >1 QRAC shows that its higher-than-classical performance 
has something to do with a CHSH inequality test. What exactly is the relation? Since there 
are no simultaneous measurements of different observables here, we cannot claim here that the 
quantum advantage comes from non-locality. 

Now I will show that the quantum advantage comes directly from contextuality. First, let 



us note that when performing the protocol N times on multiple pairs of qubits in state (7.1) 
about N/2 times Alice will need to do the ir rotation on Bob's qubit, and she will not in the 
other half of the particles. 

The subset of pairs not rotated by tt remain in state ([7. 1|) , and the measurements performed 
are those that maximally violate the CHSH inequality. Since a strict locality condition was not 
imposed here, the CHSH inequality violation in this case points towards violation of a weaker 
assumption, that of non-contextuality of experiment outcomes. 

It is easy to check that a tt rotation around the z-axis of qubit B takes state (f/M| ) into 

|^-> = -^(I°a0b>-|1a1b». (7-3) 

It is true that Alice's measurement happens before Bob's, and adherents of the 'state collapse' 
interpretation of quantum theory can say the state is no longer Q7.3j ) when Bob measures it. 
Nevertheless, the measurement statistics and correlations are the same as those obtainable from 
state ( f^3 ) under strict locality conditions. This state also violates maximally the CHSH in- 
equality^ 2l]) . 

Note that while each subset maximally violates the CHSH inequality, taken together there is 
no violation. This, however, is no problem, since Alice and Bob actually need to get together and 
compare notes in order to verify the stronger-than-classical correlations - then they can check 
whether a rotation was performed or not, and accordingly which state was effectively measured. 
This naturally leads to separate analyses of the two subsets, resulting in a maximal violation of 
the CHSH inequality. 

We have just seen that the measurements violate the CHSH inequality, but still are necessar- 
ily local, as they depend on Alice preparing qubit B and then sending it to Bob. As we have seen 



in section |2.3| , quantum non-locality tests are a special type of contextuality tests. In particular, 
when any non-locality test is performed with time-like separation between the measurements, we 
cannot rule out a local explanation for the outcomes. But despite being compatible with a local 
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hidden-variable explanation, the observed outcomes violate the non-contextuality hypothesis. 
In other words, the time- like separated CHSH inequality measurements involved in this QRAC 
protocol can be interpreted as a contextuality test. For further discussion on experimental fea- 
sibil ity of con textuality tests which exploit this relation between non-locality and contextuality 
see fl77j|l46j . 

Next we will have a look at a simple modification of the protocol, which turns it into an 
equivalent of a CHSH non-locality test. 



7.2 QRAC's and non-locality 

Taking into consideration our discussion above of the 2— >1 QRAC, it is easy to change the 
protocol somewhat and obtain the equivalent of a genuine CHSH inequality test. 

The QRAC we presented above takes its advantage from using quantum instead of classical 
communication. There is another way of obtaining a quantum version of these codes, by keeping 
the classical communication, but using the resource represented by measurements on previously 
shared entangled states. By exploring the strong correlations between measurements on these 
entangled pairs of particles, the parties can save on classical communication use. 

The simplest such entanglement-based QRAC is again a 2^1 quantum code using classical 
communication of one bit from Alice to Bob (the same amount as the classical code), plus 



measurements on one previously shared maximally entangled state (7.1). The measurements 
are those presented above, i.e. the same as for a CHSH test. The difference from the QRAC 
protocol described in the previous section is that here Alice never performs the conditional 7r 
rotation. This enables them to share the state beforehand, and even do all measurements under 
strict locality conditions (i.e. with space-like separation). As we have seen, however, half of 
the time Bob gets exactly the opposite result of what he should get, for a successful quantum 
protocol. 

It is here that the classical communication part of the protocol comes in. With a single bit of 
communication, Alice can tell Bob whether he should have performed the ir rotation she did not 
have a chance to. While he cannot physically do this operation on his qubit after measurement, 
he can obtain the correct outcome just by flipping whatever outcome he got, conditionally on 
Alice's one-bit classical message. By doing this, he recovers the outcomes he would have obtained 
using the four-state encoding of the quantum-communication version of the QRAC. This enables 
him to succeed with a higher-than-classical probability of success p q = cos 2 (-7r/8) ~ 0.85. 

We thus see that it is the non-locality of the measurement outcomes on a maximally entangled 
state that allows for an entanglement-based QRAC with higher-than-classical performance. 



7.3 Relations with other applications 

We have just seen that by using a single bit of classical communication plus entanglement, Alice 
and Bob could contrive a protocol enabling Bob to deterministically obtain the precise state he 
requires for a QRAC with high probability of success. This is a specific example of a general 
process known as remote state preparation. 

Remote state preparation is a protocol by which Bob is able to recover an arbitrary quantum 
state known to Alice, by suitable measurements on some previously shared entangled states, plus 
classical communication only. The simple protocol we have used above to remotely prepare one 
of the four states necessary for the QRAC is actually sufficient to prepare any state on the 



equator of the Bloch sphere. This was pointed out by Pati in [152]. 

Some more general remote state preparation protocols were developed by Bennett and co- 
workers in fT^| . These include protocols for general qubit states (not necessarily on the Bloch 
sphere equator) and general higher-dimensional systems. 

There is also a relation between the 2^1 QRAC and a process known as quantum dense 
coding pq[ . Quantum dense coding allows Alice to send Bob two bits of information with the 
transmission of a single qubit, supplemented by previously shared entanglement. Let us see how 
we can use dense coding in a type of QRAC, and the caveats involved. 
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Dense coding is possible because a two-qubit Bell state can be locally transformed into any 
of the four orthogonal Bell states at will. After sharing a Bell state with Bob, Alice can encode 
a two-bit value by locally transforming the two-qubit state into one out of the four possible 
orthogonal Bell states. She can then send her qubit to Bob through a quantum channel. With 
both qubits in his hands, he can discriminate the four orthogonal states and read Alice's two-bit 
message. 

In the previous sections we have seen two ways of implementing a 2— >1 QRAC: either by 
sending a single qubit from Alice to Bob, or by sending a bit encoding the outcome of a local 
measurement on previously shared entangled qubits. Dense coding offers a third way of perform- 
ing a QRAC, which combines the ingredients of these two approaches. Using dense coding, Alice 
can send a two-bit message to Bob using both shared entanglement and qubit communication. 
Interestingly, with both resources they can do better than with either isolated resource, because 
now Bob can decode both bits with probability p q = 1. 

In order to do so, however, Bob's decoding procedure requires Bell basis discrimination, and 
this is hard to do experimentally with photons, as a coupling between them is required. For this 
reason, in section 7.6 below I will only consider the experimental requirements for a realization 
of the 2— >1 QRAC using either entanglement or qubit communication. 



7.4 QRAC's and quantum communication complexity 

In this section we will see that an altogether different quantum information processing protocol 
is actually equivalent to the simple 2— >1 QRAC we have been discussing. In [35] Buhrman, Cleve 
and van Dam introduced a two-party quantum communication complexity protocol which was 
shown to be better than its optimal classical counterpart. Let us now review this communication 
complexity task, and show how the 2— >1 QRAC solves it. 



7.4.1 A simple two-party communication complexity task 

Let us now review the communication complexity problem proposed in |35|| . Let Alice and Bob 
each have two-bit strings x = x\x 2 and y = y\V2, respectively. Alice is allowed to send just a 
single bit to Bob, who must then compute: 

g(x,y) = xi ®yi®{x 2 A y 2 ), (7.4) 

where © denotes sum modulo 2, and A is just the AND Boolean operator. The bit-strings x 
and y are drawn from a random uniform distribution. A quantum protocol for this problem 
was presented in |3l| as the first two-party example of entanglement-enhanced communication 
complexity. 

It is clear that a single bit of communication from Alice to Bob is not enough for him to always 
compute g(x, y) successfully. What interests us now is to find the probability of obtaining the 
corre ct value of g(x, y) in three communication complexity scenarios we have sketched in section 
[4.1.1| . These involve using: a bit of communication (classical model); a qubit of communication 
(quantum-communication model); or a bit of communication plus measurements on entangled 
pairs (entanglement-based model). 

In the next section I will show that this communication complexity task is equivalent to a 
2 — ► 1 random access code, and therefore also has an optimal classical probability of success 
Pc = 3/4. 



7.4.2 Quantum protocols 

Here we will see that the 2— >1 QRAC we have studied above can be used to solve this commu- 
nication complexity problem. 

What exactly must Bob know in order to solve the task? He has some partial information 
about function g(x, y), given by his own data y. For each value of y, the table below shows what 
g(x,y) reduces to, and lists one particular bit value that would enable Bob to compute g(x,y): 
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y 


g(x, y) = x\ © yi © (x 2 A y 2 ) 


Bob needs to know 


00 


Xi 


El 


01 


Xx © 1 


El 


10 


X © Xl 


x © a?i 


11 


X ©Xi © 1 


x © Xl 



We see that Bob can compute g(x,y) if there is some way for him to choose which bit to 
read: either x\ or xq © x\. This can be achieved with average probability p q = cos 2 (-7r/8) with 
the 2— >1 QRAC I have discussed above. As we have seen in sections 7.1 and |7.2|, this can be 



done either by performing measurements on a maximally entangled pair of qubits (and sending 
a bit of classical communication) or by directly sending a qubit instead of a classical bit from 
Alice to Bob. These are the entanglement-based and the quantum-communication protocols, 
respectively. 

This solution using the 2— >1 QRAC is equivalent to the original entanglement-based solution 
presented in [35]. Its advantage is that it uncovers more clearly the real nature of the problem. 
Now it is easier to see that this quantum communication complexity protocol also derives its 
higher-than-classical performance from quantum contextuality and non-locality. 



7.5 Invariant information and QRAC's 



In section [2.5| we have reviewed the definition and some properties of the invariant quantum 
information Itotai introduced by Brukner and Zeilinger. In that introduction, I stressed that 
hotai could be motivated with a very simple application in mind, which involved quantifying the 
entropy, or information, of mutually exclusive measurements that can be performed on a single 
quantum system. The measurements we considered were projective measurements on different 
mutually unbiased bases (MUB's) []. Motivated by the appearance of two MUB's for a qubit 
in the preceding sections of this chapter, here we will investigate what Itotai can tell us about 
QRAC's. 

The most symmetrical QRAC construction involves having Bob decode the state sent by 
Alice through projection on a set of alternative MUB's. The choice of MUB is made depending 
on which information Bob wants to decode from the state sent by Alice. The simple 2 — ► 1 QRAC 



presented in section 4.2.1 is an example of that. Alice encodes her two bits of information into 
four states, each of which has a high overlap one basis vector from each MUB. At the same time, 
each encoding state has a small overlap with the other two vectors, representing the outcomes 
that Alice does not want Bob to obtain. As we have seen, this simple scheme can reach a 
probability of success p q = cos 2 (7r/8) ~ 0.85, which is higher than the optimal classical protocol 
(which achieves only p c = 0.75). 

If we did not know about this 2 — ► 1 QRAC, however, it would still be possible to obtain 
a tigh t bound for p q just by using Brukner and Zeilinger's invariant information Itotai [see eq. 



(|2.31| )1. In this section we will see how Itotai can be used to obtain bounds on the probability of 



success of a wide range of QRAC's. 



7.5.1 Bounding the efficiency of simple QRAC's 



To obtain a bound on th e quantum probability of success p q for the 2 — > 1 qubit code we have 



analysed in section 4.2.1, let us start by recalling the expression of Itotai for measurements over 



two MUB's for a qubit: 

2 2 



hotai = E = EE (p? - 1) > ( 7 - 5 ) 

j=l j=l 1=1 ^ ' 

(i) 

where p\ are the probabilities of the projection over MUB j to have outcome i. Let us assume 
there exists an encoding which maximises this information (achieving Itotai = 1/2), by equally 



1 For the definition of MUB see section 2.5.2 
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Figure 7.1: Encoding states for the 3 — > 1 QRAC using a single qubit. Alice prepares one 
out of eight states on the vertices of a cube inscribed within the Bloch sphere, depending on 
her three-bit string. The angle 9 is such that cos (0/2) = 1/2 + a/(3)/6 ~ 0.79, which is the 
probability of Bob correctly decoding a single bit out of the three. 



distributing the information over the two alternative measurements. This would mean that the 
information for each measurement would be half the maximal, or / = 1/4. This is achieved 
when the probability of success is 



I(p, l-p) = (p- 1/2) 2 + ((1 -p)- 1/2) 2 = 1/4 ^ p 



1 y/2 

2 + ~ 



0.85. 



(7.6) 



This is actually achieved by the 2^1 QRAC we have reviewed in section 4.2.1. We thus see 
that equally distributing the invariant information over the two MUB's automatically allows us 
to compute a tight bound for the efficiency of the 2 — ► 1 QRAC we have discussed previously. 
This also shows that this QRAC is optimal. 

We can also apply this simple reasoning to a 3 — > 1 QRAC which encodes three bits into a 
single qubit. This QRAC is a straightforward extension of the 2 — > 1 QRAC from the equator 
to the whole surface of the Bloch sphere. It was attributed to Chuang in reference |J. 

With three bits, Alice has 2 3 = 8 possible bit-strings 60^1^2- ^or eacri possibility she will 
prepare one particular state from the set depicted in figure 7A. These states lie on the vertices 
of a cube inscribed within the Bloch sphere. If Bob wants to read out bit bo he measures along 
the x-axis and associates a positive result with bo = 0. To read bits b\ [62] he measures along 
the y-axis [z-axis] and again associates a positive result with b\ = [62 = 0]. It is easy to see 
Bob's probability of suc cess is given by p q = cos ((9/2) = 1/2 + v / (3)/6 ~ 0.79, where the angle 



9 is given in figure 7T 
probability p, 



The optimal classical 3 — ► 1 random access code succeeds only with 
0.75, as can be checked easily through a search over all deterministic protocols. 
Also in this case, an equipartition of the invariant information Itotal over the three MUB's 
bounds p q tightly. Dividing the total information of a qubit by three we get that each two- 
outcome measurement can have at most 1 = ^ = 1/6. To find explicitly the bound on p\ from 
the invariant information, we just need to solve: 



I = (p - 1/2) 2 + ((1 - p) - 1/2) 2 = 1/6 p = I + 7(3)/6 ~ 0.79, 



(7.7) 



which shows that equipartition of the invariant information over the three MUB's again gives a 
tight bound on the efficiency of a 3 — > 1 QRAC. 
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7.5.2 More general bounds 

We can now ask whether Itotai can be used to obtain tight bounds for more general QRAC's. 
In this section we will see that in the case of two and three MUB's for a qubit, it is a mere 
geometrical coincidence that the maximum overlaps between encoding and decoding states given 
by the invariant information can be achieved. In higher-dimensional Hilbert spaces there will not 
necessarily exist such symmetries allowing us to find encoding states with the desired overlaps 
with the decoding projective measurements. This difficulty can be illustrated with a simple 
example. 

Let us now work out a QRAC which encodes two trits (i.e. three-level systems) into a single 
qutrit. Bob's measurements will consist of three-outcome projective measurements over two 
MUB's. Alice's encoding procedure involves preparing one of nine states, depending on the two 
trits she has. 

Let us use two MUB's for a qutrit presented in fTTJ. They are given by: 



\a x ) = (u\Q) + |1) + |2))A/3, 

|A) = (|0)+w|l) + |2))/V3 ) (7.8) 
| 7 i) = (|0> + |l)+H2))A/3, 

and 

|q 2 ) = (oj*|0> + ]1> + |2>)/v / 3, 

|A> = (|0>+ W *|1> + |2»/V3, (7.9) 
| 72 ) = (|0> + |1)+^|2))/V3, 

where u = exp(2-7ri/3). 

We can search now for the encoding states Alice will use. To start with, we must find a state 
which maximises the sum of the overlaps with \a±) and \cx.2). A state with this property is: 



|VW 2 > = o|0) + V(l - o 2 )/2|l) + - a 2 )/2|2), (7.10) 

witho = -(j + ^)'. 

For the decoding procedure, Bob will project the qutrit into the first or second basis depend- 
ing on which trit he wants to read. The state ( 7.10| ) above will result in an outcome \a\) or |a 2 ) 
with probability 

1 /3 

I (^ QlQ >l) I 2 = I (^aalai) I 2 = 7, + "V - °- 789 - ( 7 - U ) 

2 6 

Given the symmetries present in the two MUB's above, it is easy to design unitaries which 
we can apply to state ( [7.10[ ) to obtain the other eight encoding states, each of which has the 
same large overlap with one basis state from each MUB. Let us now define two unitaries which 
will be useful for that: 





V : { |1)-H2) ,&ndU:< . (7.12) 



The unitaries U and V above transform the basis states in t he tw o MUB's in a useful way. We 
can apply some combinat ions of them on the encoding state ( |7.10| ) to obtain the other encoding 



states, as we see in table 7.1 



Together with the two MUB's above, these nine states form a 2 — » 1 QRAC for trits which 
is successful with probability p q = 1/2 + \/3/6 — 0.79. We will now see that this p q is smaller 
than the bound obtained from the invariant information. 

If we partition the total information content of a qutrit (Itotai = 2/3) into two equal parts 
corresponding to each MUB, we should have a maximum of I = 1/3 for each three-outcome 
measurement. The largest probability p compatible with I = 1/3 is given by 

,-3 +2 ^-1/3 =3^= 3 + — 0.805, (7.13) 
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Table 7.1: Encoding states for 2 — > 1 QRAC with a qutrit 



Unitary applied to "00102) 


Encoding state obtained 


11 


IV , OlQ 2 ) 


V 




V 2 


1 "071 72) 


u 


1*071^2 ) 


vu 




v 2 u 


I0/31O2) 


u 2 


I#17 2 ) 


vu' 2 


1 "071 02 ) 


V 2 U 2 


|0oi/3 2 ) 



but this is larger than p q = 1/2 + v3/6 — 0.789 found for the QRAC described above. 

This shows that other considerations can get into the way of quantum random access codes, 
other than the bounds imposed by the invariant information of Brukner and Zeilinger. The 
geometry of the set of pure states for d > 3 is quite complicated, and ultimately limits the 
construction of QRAC's, as the example above shows. This difficulty can show up in different 
ways. In general, it may not be possible to share the invariant information equally among just 
two MUB's. Even if that is possible, this may require two high-probability outcomes together 
with a third low-probability outcome. This is not favourable for the QRAC, as it ideally requires 
just one high-probability outcome to guarantee a high decoding probability for Bob. With just 
one high-probability outcome, the optimisation done above shows it is impossible to find states 
which achieve the optimal sharing of Itotai between two MUB's. 

Much research has been reported recently on quantum information protocols using systems 
of dimensionality d > 3 |}7, 45, 30, 44 1. It has been shown t hat i n some cases, it could be possible 
to implement such protocols with low detection efficiency [140]. The 2 — ► 1 QRAC with qutrits 



presented above represents an initial effort towards developing QRAC's for higher-dimensional 
systems. It would be interesting to find how the gap between quantum and classical pe rform ance 
behaves as d increases. Some results along these lines have already been reported in [148]. 

QRAC's offer a genuinely quantum way of encoding information, which naturally imposes 
restrictions on how much of it can be read out. This limitation of the quantum readout appears 
because of the non-commutativity of the decoding measurements. Instead of considering this as 
a problem, here it is actually used to obtain a higher-than-classical efficiency for this task. This 
seems to be a general idea behind the development of many quantum information applications - 
we start from an apparent problem or curiosity of quantum theory, but turn the understanding 
of these characteristics into useful quantum information applications. 



7.6 Experimental feasibility 

In previous sections we have seen that the simple 2— >1 QRAC is effectively a demonstration of 
quantum contextuality, and if performed using entanglement, of non-locality as well. Besides 
illustrating these fundamental characteristics of quantum theory, we have seen that this QRAC 
is also behind the higher-than-classical performance of another information processing task, the 
communication complexity problem we discussed in section [7.4| . Taking all this into account, 
it is natural for us to ask about the experimental difficulty of implementing the 2— >1 QRAC. 
In this section we will analyse the experimental requirements necessary for implementing this 
simple quantum protocol. 
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7.6.1 Detection efficiency and noise rate 

Our analysis of the experimental difficulties of implementing the QRAC starts by discussing 
the two main limitations for these quantum experiments: finite detection efficiency rj and non- 
zero background rate 1 — \i. In all the discussion on experimental feasibility in this thesis, we 
will consider protocols involving one or more detections of a single qubit only, which involve 
discrimination between two given orthogonal states. 

Let us now define more precisely what we mean by detection efficiency rj. The quantum 
protocols require the parties to project the qubit along a certain basis of two orthogonal states. 
By detection efficiency I mean the fraction of events for which the detecting apparatus clicks, 
successfully telling apart the two orthogonal states we are interested in. In a typical quantum 
optical experiment, the detection efficiency depends basically on the quantum efficiency of single- 
photon detectors, which are usually of the order of a few percent only. 

We should also be concerned about background noise, i.e. spurious detector 'clicks' arising 
from background photons or detector noise. We can account for these by introducing a factor 
fj,, defined such that (1 — fj,) is the fraction of detections at each party due to the background 
or detector noise. We assume that the noise is uniformly random, yielding each of the two 
measurement outcomes with probability p = 1/2. Note that if these outcomes are found to 
deviate from uniform randomness in an actual experiment, then we would be able to use the 
regularity to improve our protocol; in other words, a uniformly random background is a worst- 
case assumption. 

Imperfections in the state preparation procedure can also be incorporated in our model 
through the parameter /i. For example, suppose Alice needs to send state |0) to Bob, but only 
manages to prepare an imperfect state. Suppose further that they perform many experiments 
to characterise exactly the state that she does manage to produce, and they find out that with 
probability po the outcome is what she intended (i.e |0)), but with probability (1 — po) it is the 
opposite. Then, for all practical purposes, we can model the situation by assuming she actually 
prepares state 

p = / x|0)(0| + (l (7.14) 

where /i = 2po — 1 is the random background rate. A similar procedure can be used in the case 
of imperfect preparation of maximally entangled states, used in the entanglement-based QRAC 
we have discussed. 



7.6.2 Feasibility of qubit communication protocol 

In the original QRAC protocol Alice communicates a single qubit to Bob, preparing it in one of 
four states depending on her two-bit string. The quantum communication involved suggests that 
this protocol can be implemented with photons. Alice sends Bob a photon whose polarisation 
encodes the qubit. 

Sometimes the protocol will work and sometimes not. With probability r//x Bob will suc- 
cessfully measure a signal photon, getting the right result with probability p q ~ 0.854, as we 
have seen in section 7.4.2. Whenever this does not happen, he will succeed with probability 1/2, 
either because he had to guess after a detector failure, or because he measured a background 
photon. Taking all into account, for a higher-than-classical performance of the 2^1 QRAC it 
is enough to have: 

r]fip q + (l-rin)->p c , (7.15) 
or, plugging in p q = cos 2 (-7r/8) and p c = 3/4, 



M > 



V2 
2tj ' 



(7.16) 



If there is no background {(j. = 1), we see that it is sufficient to have a detection efficiency 
7] > \/2/2 — 0.71. The analysis above was first made by van Dam in [187] for the communication 
complexity protocol presented in section 7.4, but restricted to the cases r\ = or fj, = 1. 
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If an experiment fulfilling condition ( 7.16Q is performed, then we will have a one-qubit quan- 
tum protocol which outperforms the optimal classical protocol using just one classical bit. These 
conditions, however, canno t ye t be reached in existing quantum optical experiments (for a state- 
of-the-art experiment see [ [L25| ). In the next chapter we will analyse a similar problem whose 
quantum protocol can achieve a higher-than-classical performance in a currently feasible exper- 
iment. 



7.6.3 Feasibility of entanglement-based protocol 

As we have seen in section the entanglement-based protocol is equivalent to a CHSH in- 
equality test. When analysing the feasibility of a CHSH test, some rather complex arguments 
are necessary to find the conditions on r/ and fi that exclude any hidden-variable theory expla- 
nation (see |130(| ). Interestingly, I will show these conditions can be obtained directly from the 
analysis of the entanglement-based QRAC, by requiring that it must have a higher-than-classical 
probability of success. 

First I present the case of limited efficiency r/ < 1 but /i = 1 (no noise but limited detection 
efficiency), as discussed by van Dam [187] in the context of the communication complexity 



problem reviewed in section 7.4.1 



Before trying the quantum protocol, Alice and Bob must agree on a procedure for the case 
when their detectors do not fire. Of course, they are not allowed to communicate the failure, as 
this would constitute some further communication beyond the allowed one bit. The most effective 
procedure is for Alice and Bob to use the optimal classical protocol whenever their detectors 
fail. In the case of a single detector failure, Bob's guess will still be correct with p = 1/2. When 
both detectors fail, however, they will still obtain an optimal classical probability of success 
p c = 3/4. Assuming independent errors at the two detectors, we can calculate the minimum 
detector efficiency needed: 

r] 2 p q + (1 - T]) 2 Pc + r/(l - r/)- + (1 - r/)r/^ > p c 
2p c - 1 

=>V> : 7- (7.17) 

Pq + Pc ~ 1 

Our 2— >1 QRAC has p q = cos 2 (7r/8) and p c = 3/4, which results in a minimum necessary 
detector efficiency 



r/min = 2(V2 - 1) ~ 0.828, (7.18) 

which is also the minimum detector efficiency necessary for a loophole- free CHSH inequality test 
@. 

To take into account the probability (1 — fj,) of each detector firing due to a background 
event, we can classify each run of the experiment into one of three classes: 

1) Alice and Bob measure their particles accurately with probability r/ 2 /x 2 , in which case 
they apply the quantum protocol and succeed with probability p q = cos 2 (7r/8). 

2) Both their detectors fail to detect anything, which will happen in a fraction (1 — r/) 2 of 
the runs. In this situation they will both use the classical protocol with a probability of success 
Pc 3/4. 

3) In all the other situations Bob will use either the quantum or the classical protocol, 
depending on whether he detects a photon or not. However, due to either background or lack 
of detection at Alice's side his guesses will be random, yielding a success rate of only 1/2. 

Taking all this into account, the condition for a quantum protocol which is better than the 
optimal classical one is: 

tfl± 2 p q + (1 - rj) 2 Pc + (1 - r/V - (1 - rj) 2 )^ > p c . (7.19) 



Plugging in the values for p q and p c we obtain 




(7.20) 
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Figure 7.2: The shaded area indicates the region where the background level (1 — fj,) and de- 
tector efficiency rj allow for an entanglement-based 2 — * 1 QRAC with better-than-classical 
performance. The area corresponds to that given by inequality ( 7.20| ). 



In particular, for perfect detectors (rj = 1), we need [i > 2~V 4 ~ 0.841. In fi gure |7.2| I represent 
the region in the r\ x \x parameter space that guarantees a higher-than-classical success rate. 

The favourable ry x /i region coincides exactl y wit h the conditions necessary for a violation of 
the CHSH inequality (as can be gathered from [132], for example). This is not a surpris e, g iven 
the equivalence between the quantum protocol and a CHSH test pointed out in section |7.2| . 

Interestingly, we have obtained information on how hard it is to experimentally establish 
quantum non-locality by a simple analysis of a quantum information processing application. 
It is important to remember that to beat the optimal classical protocol it is not necessary to 
enforce strict locality for the measurements. It is enough to obtain the stronger-than-classical 
correlations with time-like separation only. Under these conditions, the QRAC becomes a quan- 
tum contextuality test, and derives its higher-than-classical performance from this quantum 
characteristic. 

A first candidate for experimental implementation of the protocol would be entangled pho- 
tons generated by a parametric down conversion crystal. The CHSH test with entangled photons 
which achieved the highest detection efficiency to date seems to be the one reported by Wein- 
furter's group [125], the detection efficiency r/ being slightly over 28%. 

Maximally entangled pairs of photons can be generated with high fidelity and measured with 
precision, which guarantees the equivalent of a very high /i. However, detector efficiencies r\ are 
still short of those necessary. Ion traps techniques, on the other hand, can reach high r] and // 
[ |166| ]. This makes ion traps a natural choice for the experimental demonstration of the simple 
quantum random access code presented here. 



7.7 Chapter summary 

In this chapter I have analysed different aspects of some of the simplest quantum random access 
codes. We have seen that the simplest of them, the 2^1 QRAC with a qubit, can be shown 
to derive its performance either from quantum contextuality or from non-locality, depending on 
how the protocol is implemented. 
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I have shown how the simplest QRAC solves a two-party communication complexity problem 
proposed in [35|. This shows that the higher-than-classical performance of that protocol can 
also be attributed to contextuality and non-locality. 

We ha ve a lso seen that the invariant information of Brukner and Zeilinger, which I presented 
in section [2j| can be used to obtain upper bounds for the efficiency of a class of QRAC's. By 
building a QRAC with three-level systems, or qutrits, we have seen that those bounds are not 
tight in general, despite being tight for the QRAC's I presented using a qubit. 

Then I presented an analysis of experimental conditions on detector efficiency and back- 
ground counts, which could enable a better-than-classical protocol. From a simple analysis of 
the quantum and the optimal classical protocol I showed that these conditions are equivalent to 
those required for a CHSH non-locality test, except that locality conditions are not necessary. 
By relaxing this requirement the protocol becomes equivalent to a quantum contextuality test, 
and is amenable to implementation using current ion trap technology. 



Chapter 8 



On multi-party quantum 
communication complexity 

In this chapter I will analyse different ways of solving a simple multi-party communication 
complexity problem first proposed by Buhrman and co-authors in [35, 38]. As we have seen in 



section kLlL quantum mechanics can help in reducing the communication complexity of certain 



problems. In the context of the problem we will analyse, references [35, £3] showed that multi- 
party entanglement can reduce the required amount of communication for a flawless protocol. 
In this chapter I approach the problem somewhat differently, in a way that allows a fairer 
comparison with the classical protocols. For that, I will restrain the amount of communication 
allowed and explicitly compute the probability of success for both the classical and quantum 
protocols. This will enable us to consider realistic implementations, involving limited detection 
efficiencies and non-zero noise. 

We will see that the entanglement-based quantum protocol is better than classical protocols 
due to multipartite non-locality. Accordingly, the protocol can be interpreted also as a multi- 
particle non-locality test. I will find a set of sufficient conditions on detection efficiency and 
noise for such tests, directly from the quantum protocol. 

Remarkably, the quantum protocol involving qubit-communication can be demonstrated with 
realistically low detection efficiencies. This is a practically unique situation in quantum infor- 
mation experiments, which usually have low detection efficiencies, which requires post-selection 
of experimental runs or a fair sampling assumption. This is simply not an option if we are inter- 
ested in practical quantum applications, as opposed to proof-of-principle demonstrations. We 
will see that the qubit-communication protocol can be shown to be better than any equivalent 
protocol using classical communication with no need for such artificial performance-enhancing 
procedures. 

This qubit-communication solution to the communication complexity problem suggests that 
entanglement is not the only non-classical resource giving quantum information processing its 
power. We will see that this quantum protocol relies on the continuous nature of the set of 
quantum pure states for its higher-than-classical performance. In the next chapter I will analyse 
this issue in more depth by discussing the quantum solution to a similar problem. We will see 
that the quantum protocols tell us many things about the foundations of quantum theory and 
computation capabilities of quantum systems. 

This chapter is organised as follows. After a review of the communication complexity task 
definition (section |8.1[), I will discuss different ways to compare quantum and classical protocols, 



and the reasons for the particular choice I make. In section 8.2 I determine the optimal classical 
probability of success for the task, for a fixed amount of classical communication. Then in section 
.31 1 present quantum protocols that solve the task ideally with unit probability, resorting either 



to entanglement or to qubit communication. In section 8.4 I discuss how these protocols are 



affected by experimental limitations, and I conclude with a chapter summary in section 
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8.1 A multi-party communication complexity task 

In this section I review the Modulo-4 Sum communication complexity problem defined for three 
parties by Buhrman, Cleve and van Dam |35| , and later generalised to N parties (N > 3) in 
|j38| . An updated version of |]35| was published in p6[| . 

The problem can be stated as follows. Each party Pi receives a two-bit number input Xi, 
subject to the constraint: 




The strings are chosen randomly with a uniform probability distribution among those combina- 
tions that satisfy eq. ( pTl] ) above. After possibly some distributed computation, involving some 
communication between the parties, one of them (say the last one Ppj) must announce the value 
of the Boolean function 

(8.2) 

In other words, each party is given a number Xi £ {0, 1, 2, 3}, subject to the constraint that the 
sum of all Xi is even. After some communication the last party must decide whether the sum 
modulo-4 is equal to or 2. 

The type (and amount) of communication allowed will be important when we compare 
quantum and classical protocols. In the next section I present two different sets of constraints 
on the communication allowed, and the reasons that motivated me to consider the differences 
between quantum and classical in these two settings. 

8.1.1 Restraining the amount of communication 

There are different approaches one can take when studying this problem. The original papers 
] proved bounds on the amount of communication necessary for solving this problem 
classically in the public communication setting, where each party broadcasts to all others. Then 
an entanglement-based quantum communication complexity protocol was shown to be possible, 
which reduced the necessary amount of communication for a perfect protocol. 

Instead of calculating the minimal amount of communication sufficient to solve the task, 
another thing one might want to do is to limit the amount of communication, and then find the 
optimal probability of success achieved by quantum and classical protocols. This is the approach 
I adopt here, for the reasons I sketch below. 

First, it could be the case that exact protocols exhibit a large gap between quantum and clas- 
sical, which however disappears when we consider more realistic, probabilistic protocols. Directly 
comparing the probability of success of general probabilistic classical protocols with quantum 
protocols (which are sometimes intrinsically probabilistic) seems to be a fairer comparison. 

A second motivation for this approach are experiments: knowing the optimal probability 
of success of ideal quantum and classical protocols, we will be able to determine sufficient 
experimental requirements for implementing these protocols in practice. Factors such as noise 
and finite quantum detector efficiency tend to lower the efficiency of a realistic quantum protocol, 
and these can be easily included in my analysis. I will derive sufficient conditions on these factors 
for practical quantum protocols that are still better than the optimal classical ones. 

Still keeping an eye on feasible experimental implementations, in what comes below we will 
require the communication to be sequential. This decision is related to the fact that the sequen- 
tial quantum communication necessary to solve this problem can be conveniently realized by 
sending a single photon through a series of optical elements representing the parties. We will 
soon see that such quantum communication can outperform the equivalent classical communi- 
cation setting even with realistically low quantum detection efficiencies. 

In common with the two-party communication complexity problem I described in the last 
chapter, the quantum protocols I will consider here are of two types, involving either using 
quantum entanglement or using quantum communication instead of classical communication. 
For each type of communication I will compare the quantum protocol with an equivalent classical 




mod 4 



38 



CHAPTER 8. ON MULTI-PARTY QUANTUM COMMUNICATION COMPLEXITY 71 



protocol, and find the conditions under which the quantum protocol presents an advantage. For 
the reasons I explained in the last paragraphs, I will impose two different restrictions on the 
amount of communication allowed: 

A- Entanglement-based protocol. Here we will compare quantum and classical perfor- 
mances when each party Pj is allowed to send just a single bit of classical communication to the 
last party P/v ■ We compare the optimal classical protocol with the entanglement-based quantum 
protocol which supplements this communication with measurements on multi-party entangled 
states. 

B- Sequential qubit communication protocol. Here the allowed one (qu)bit of commu- 
nication must be sequential: party Pi sends one (qu)bit to party P2, who sends one (qu)bit to 
party P3 and so forth, until the last party P/v who has to announce the value of f(~x). The com- 
parison between quantum and classical is made by comparing the probability of success of the 
optimal one-bit message classical protocol with that of a one-qubit message quantum protocol. 

The entanglement-based protocol can be interpreted as a non-locality test for three or more 
entangled qubits. Accordingly, the sufficient experimental conditions I will derive will also be 
sufficient for genuine multi-particle non-locality tests. This represents a simple example in 
which a quantum information application brings results which are relevant to foundational tests 
of quantum theory. 

We will see that the qubit-communication protocol derives its higher-than-classical proba- 
bility of success from another quantum characteristic. I will briefly discuss this at the end of 
section p. 4. 2 , and at considerably more length in the next chapter. The suggestion of a qubit- 



communication protocol for the tripartite Modulo-4 Sum problem was made independently also 



by Buhrman, Cleve and van Dam [36], in an update of the earlier paper 



8.2 Determining the optimal classical protocols 
8.2.1 Sequential communication 

First, let us obtain the optimal classical success rate for the Modulo-4 Sum problem, when we 
allow only one bit of sequential communication from one party to the next. 

For the moment let us consider only deterministic protocols. The first party Pi has access 
only to her two-bit string x%, and so can choose between 2 4 protocols. These can be represented 
by the four-bit string prot\, whose n th (n = 0, 1,2,3) bit encodes the message mi to be sent 
to P2 if x\ = n. The other parties Pj (j = 2, ...,N — 1) can choose among 2 s protocols that 
take into consideration both Xj and the message rrij-i received from the previous party. Each 
of these protocols can be represented by an 8-bit string protj, whose n th (n = 0,1,..., 7) bit 
encodes the message to be sent when 2xj + rnj-x = n. 

Each possible deterministic protocol can then be represented by the (N — l)-tuple 



prot = (proti,prot2, ...,protN_i). 



Finding the probability of success of a given protocol prot is a straightforward computation. It 
must be done in parts, each part corresponding to one of the possible values for the two-bit string 
xn belonging to the last party. Starting with xjy = 0, we produce a list of all possible input data 
{xi, X2, xjy-i} compatible with xn = 0. Then we compute the messages m^-i corresponding 
to each (according to protocol prot), and find the fraction of cases in which P/v's most likely 
guess about the value of function / would in fact be correct. This is repeated for xn = 1,2 
and 3, and the results averaged to obtain the overall probability of success p c . The optimal 
deterministic protocol can then be found by a computer search over all 2 4 (2 8 ) 7V " 2 = 2( 87V - 12 ) 
protocols. 

I wrote a C computer program that finds the optimal classical probability of success for 
A = 3,4, 5 and 6. Moreover, a limited search over protocols for larger number of parties yields 



some lower bounds for p c . The results are summarised in table 8.1. 
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Table 8.1: Optimal p c for N parties and sequential communication 



N 


3 


4 


5 


6 


7 


8 


Pc 


3/4 


3/4 


5/8 


5/8 


> 9/16 


> 9/16 



Table 8.2: Optimal p c for N parties and non-sequential communication 



N 


3 


4 


5 


6 


7 


Pc 


3/4 


3/4 


5/8 


5/8 


9/16 



The optimal p c for ./V = 3, 4, 5 and 6 is attained by many protocols, for example the one 
consisting of prot\ = 0011 and all the other protj = 01011010. The same protocol yields the 
lower bounds for the optimal probabilities of success presented above for N = 7 and 8. Checking 
that these lower bounds are tight would involve a very long exhaustive search over all protocols. 
For the purpose of comparison with the quantum protocol given below, it would be desirable to 
obtain at least an analytical upper bound for p 1 ^ that decreases with N . A rigorous proof has 
escaped me, but in the next section I make a conjecture about p c for general N. 

Up to now we have been computing the probability of success for deterministic protocols. 
In a probabilistic protocol, each party Pj implements his/her own protocol by probabilistically 
picking a deterministic protocol protj from some set of protocols, according to probabilities 
obtained from a list of random numbers. Since this list of numbers could have been shared 
beforehand between the parties, the last party P/v can know exactly which protocols were chosen 
by each of the other parties for each run of the probabilistic protocol. This means that each 
run of the probabilistic protocol is effectively a deterministic one, with a probability of success 
bounded by the optimal deterministic p c derived above. The relation between deterministic and 
probabilistic protocols for communi catio n complexity tasks is further discussed in chapter 3 of 
the book by Kushilevitz and Nisan [126]. 



8.2.2 Non-sequential communication 

Here we discuss how to obtain the optimal probabilities of success for classical protocols with 
one bit of communication from each of the first (N — 1) parties to the last party P/y. As has 
been done a couple of times already in this thesis, the optimal protocols are obtained through 
an exhaustive search over all deterministic iV-party protocols. 

How many protocols are there? The two-bit string X{ belonging to party Pj can only assume 
four values. Each party's protocol can involve a distinct binary choice of message for each of 
these values; there are therefore 2 4 = 16 possible protocols for each party. Since a total of 
(N — 1) parties enter in the communication protocol, the total number of protocols is 2 4N ~ 4 . 

Again, we see that the number of protocols increases exponentially. Despite that, an exhaus- 
tive search is feasible for small N. I wrote a computer program in C that computes the optimal 
probability of success for = 3 to 7 parties. The results are summarised in table ^2. 

As we see, the task gets harder and harder as the number of parties increases, as happened 
in the sequential communication model. At least for small number of parties, the optimal 
probabilities of success are the same for the two communication models. 

Based on the structure of the problem and on the values for p c calculated above and in the 
previous section, I conjecture that the optimal classical probability of success is given by: 



Conjecture: p c = - + 

2 i^r- 



(8.3) 



for odd N, being the same for both models of computation (sequential and non-sequential). This 
would approach the random guess success (p = 1/2) exponentially with the number of parties 
N. 
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8.3 Quantum protocols 

8.3.1 Entanglement-based quantum protocol 

In this section I will review the entanglement-based protocol for this task, first presented by 
Buhrman, Cleve and van Dam in [35], and further elaborated in j}^]. My presentation here 
will stress the fact that it is a quantum (non-local) phase that enables a higher-than-classical 
performance for this task. This suggested the use of the phase in a single qubit for the quantum 
communication protocol we will discuss in the next section. 

The quantum resource used will be a iV-party highly entangled GHZ state: 

|GHZ(0)> = i= (|0i0 2 • • • N ) + e^lhh ■ ■ ■ l N )) , (8-4) 

where <p is a phase which we will use in the protocol. 

To simplify the discussion, let us present the entanglement-based protocol for three parties 
only. The results generalise in a trivial way for more parties. 

The parties initially share a three-party |GHZ(0)) state. The quantum protocol starts by 
having each party P, apply the following unitary to his/her qubit: 

U: { |1> i°exp(!^)|l> ' (8 - 5) 



which effectively just rotates the phase <fr in the GHZ state (|8.4|) by an amount proportional to 
Pi's number X{. 

After the three parties have performed their rotations, because of the promise ( |8.1| ) the initial 
state will be transformed to: 

|GHZ(0)> - { ^SVJf^^T °r (8-6) 
[ |GHZ(tt)) if f(XA,x B ,xc) = I- 

It turns out that these two orthogonal states can be distinguished [and f(x) computed] with 
local operations and one bit of classical communication from Alice to Clare, and one from Bob 
to Clare. The local operation needed at each party consists of applying a Hadamard gate 

H .{ 10) (10) + ID) 

H ' \ |i>-^(|o>-|i» ' (8 - 7) 

followed by a measurement on the computational basis. It is easy to check that the three 
Hadamard gates transform state (ET9) as: 



|GHZ(0)) -» [000) + [Oil) + [101) + [110) , (8.8) 
|GHZ(vr)) -> [001) + [010) + [100) + [111) . (8.9) 

Alice and Bob then measure their qubits in the computational basis, and send the measurement 
outcomes to Clare. It is clear from the two equations above that Clare can distinguish perfectly 
between |GHZ(0)) and |GHZ(7r)), solving the task with certainty. 

The above protocol for thre e pa rties generalises for iV parties in a straightforward way. Each 
party P{ performs the unitary (|8.5| ) depending on his/her data Xi, followed by a Hadamard gate 
and a measurement on the computational basis. Each of the first (N— 1) parties then broadcasts 
the result to the last party Pjv, who can then compute file) unambiguously. 
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8.3.1.1 Relation with multi-party non-locality and contextuality 

The communication complexity task was devised explicitly to take advantage from the strong 
quantum correlations present in the |GHZ) state. Th ese correlations allow for the inequality- free 
proof of quantum non- locality proposed in [p2| , |145fl . The higher probability of success of the 
quantum protocol with respect to the optimal classical protocol relies heavily on this form of 
non-locality. 

In the discussion above I obtained the optimal probability of success for classical protocols, 
when the information is restricted to one bit from each party. This had not been done in previous 
discussions of this communication complexity pro blem , and enables us to directly compare the 
quantum and classical performances. In section |8.4| I will do this, taking into account finite 
detection efficiencies and imperfect state preparation. If, even with these imperfections, one can 
implement a better-than-classical quantum protocol with space-like separated measurements, 
this would constitute perfectly valid experimental evidence of multi-particle non-locality. This 
is because our calculation of the optimal classical probability of success imposes a constraint on 
any local hidden-variable theory: it must not enable a protocol that is better than the optimal 
classical one. Any protocol with space-like separated measurements that performs better than 
the optimal classical protocol must rely on non-local correlations inconsistent with local hidden- 
variable theories. 

In a more realistic setting, the measurements in a practical implementation would not be 
space-like separated. In this case we cannot claim that the measurements contradict the locality 
assumption; they do, however, contradict the non-contextuality assumption. Again we see that 
contextuality counts as a non-classical resource to be explored in quantum information protocols, 



as I have discussed in section 7.1 



This approach stresses the non-classicality of quantum information processing protocols as a 
way to investigate multipartite quantum non-locality and contextuality. It is interesting to see 
that simple arguments and computations allow us to obtain upper bounds to what is possible 
classically, and show that quantum mechanics violates these bounds even with mixed states, 
or with inefficient detectors. While Bell- type bipartite non- locality inequalities are simple and 
compelling, they get unsatisfyingly cumbersome when we increase the number of parties, or the 
number of settings at each party. My approach here also has its conceptual attractiveness, as it 
links non-locality and contextuality to an enhanced computational capability. Moreover, when 
we try to quantify multipartite entanglement of high-dimensional systems, this approach can 
be more fruitful and intuitively satisfying than deriving complex Bell-type inequalities which do 
not suggest any practical applications to the non-locality they uncover. 



8.3.2 Sequential qubit communication protocol 



In section 8.2. 1| we have seen that the Modulo-4 Sum problem gets harder and harder to solve 



classically with sequential communication, as the number of parties increases. There is, however, 
a simple quantum protocol with sequential qubit communication that has a probability of success 
p q = 1 independently of the number of parties involved. We will see that this protocol is a simple 



adaptation of the entanglement-based protocol presented in [38] and reviewed above. 
The idea is to start with the qubit in state 

|^) = i=(|0) + |l)) (8.10) 

and send it flying by all the parties, from first to last. Each party needs only act upon the qubit 
with unitary U given by eq. fl3,5p. After going through the N unitaries the qubit state will be 



l^) = -L(|o) + (-i)^)|i)) 



due to the constraint (|9.2[) on the possible inputs Xj. The last party can then measure \t/jf) in 
the {-^(|0) + |1)), t^(|0) ~~ I 1 )} basis, obtaining f(~x) with probability p q = 1. 
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Table 8.3: Conditions for better-than-classical performance 



N 






favourable /j x 77 region 


3 


2~s ~ 0.794 


~ 0.791 


M > i (V - 12r/ 2 + 12r?) 5 


4 


2~3 ~ 0.841 


~ 0.841 


3 1 

H > (-T] 4 + At] 3 - 6t] 2 + 477) 4 


5 


2~§ ~ 0.758 


~ 0.758 


3 1 
/i > ^2s (r/ 5 - 5r/ 4 + lOr? 3 - Wt] 2 + 5r?) 5 


6 


2~s ~ 0.794 


~ 0.794 


/i > ^2§ (-?7 6 + 6r/ 5 - 15r/ 4 + 20r/ 3 - 15r/ 2 + 677) « 


7 


2~f ~ 0.743 


~ 0.743 


A* > i 2 ^ I 7 ? 7 ~ 7r ? 6 + 21r ? 5 ~ 35r ? 4 + 35r ? 3 ~ 21r l 2 + 7r ?) * 



The protocol above i s an a daptation of the entanglement-based protocol presented in [38] 
(and reviewed in section 8.3.1) to the qubit-communication setting. While the entanglement- 
based protocol uses a non-local quantum phase which can be operated on locally by each party, 
the qubit communication protocol above uses a quantum phase of a single qubit to acquire 
information on f(~x) as it flies by the parties towards the last party Pjy, where a single detection 
reveals the result. The fact that this protocol involves only one detection (as opposed to the TV" 
detections for the entanglement-based protocol) leads to an experimental implementation which 
is robust against low detection efficiencies, as we will see in section |S.4.2 



8.4 Experimental feasibility 

8.4.1 Feasibility of entanglement-based protocol 

Creating entangled states of more than two qubits is experimentally much more challenging 
than creating bipartite states. Having said that, let us assume, for a moment, that the technical 
difficulties of creating GHZ states have been surpassed. In this section we will see that, in 
principle, the multi-party quantum protocols require lower detection efficiency [] 77 and tolerates 
higher background noise (1 — fi) than the two-party protocol of section [O]. As we discussed 
in section p. 3.1 , these results translate also as sufficient conditions for experimental violation of 



multi-particle non-locality. 

The analysis is similar to that presented in section |7.6| , with the difference that here we have 
N detections instead of just two. Whenever we detect a valid signal from all the TV" detectors 
(which happens with probability rj N fi N ), the quantum protocol works with probability p„ = 1. 
If the last party's detector does not click, she will rely on the possibility of the other parties' 
detectors having failed as well, and on them using the optimal classical protocol; this will happen 
with probability (l — rj) N and result in a probability of success p c given by table |8.2| . In all other 
cases, the last party will only be able to make random guesses, succeeding with probability 1/2 
only. Assuming independent errors at each detector, for a higher-than-classical probability of 
success we need 

V N » N p q + (1 - r,) N p c + [1 - W " (1 " V) N ]\ > Pc- (8.H) 
Substituting p c from table |8.2| into the inequality above we obtain the results summarised in 



table 8.3, for number of parties iV = 3 to 7. 



Table 8.3 shows the minimum 77 (when [i = 1), the minimum /x (when i] = 1), and the 
compromises between these two factors which still result in a quantum protocol with higher- 
than-classica l pr obability of success. 

In figure |8.1| I plotted the favourable 77 x \x region for iV = 3, 5 and 7. Note that the region 
monotonically increases for increasing odd N. Also, the region is larger than that required for 
the two-party QRAC we investigated in section [7l] [see inequality ( [7.20Q ] . This is related to the 
fact that quantum non-locality tests for multi-particle states can be done with lower detector 



efficiencies than for the two-particle case (see for example [131, O 



1 For the definitions of detection efficiency 77 and background noise (1 — fx) see section 7.6.1 
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Figure 8.1: Here we plot the lines delimiting the region where the background level (1 — fi) 
and detector efficiency r] allow for a iV-party entanglement-based quantum communication com- 
plexity protocol which is more efficient than the classical counterpart. The favourable regions 
correspond to the area above the lines indicated for N = 3, 5 and 7. Analytical expressions for 



these favourable regions are given in table 8.3. 



As we discussed in section 8.3. 1| , the requirements in table |8.3| also represent sufficient con- 



ditions for a multi-particle non-locality test with N = 3 to 7 qubits. Besides limited detector 
efficiency, here I also take into account limited visibility (due to background counts); this possi- 
bility has been addressed with respect to the particular experiment described in pl| in references 
[ ]60| and |179| ]. As far as I know, it is the first time simultaneous bounds both on detector effi- 
ciency and noise rate are obtained for more than three parties. 

This quantum protocol can also be used to obtain bounds on the amount of mixedness we 
can add to entangled multi-particle states and still obtain an entangled density matrix. This has 
been used as a measure of 'strength' of locality violation for large-dimensional states in p4| , W% , 
even though there has been some criticism to this criterion To see how to apply this idea, 
imagine that instead of performing the protocol with a perfect |GHZ(0)) [see equation (§OJ)], we 
instead use a mixture of it with the separable, maximally mixed state: 

p = e |GHZ)(GHZ| + (l-e)^, (8.12) 

where (1 — e) is the amount of mixedness added. The mixed state thus obtained is sometimes 
referred to as a Werner state. For large enough e it is clear (from continuity) that the quantum 
protocol should still go through with better-than-classical performance, but there must be an 
intermediary value for which this is not true anymore. This can be worked out in a straightfor- 
ward way using an argument like the one I used above for detection efficiencies and noise rates. 
For a better-than-classical performance we need: 

e+(l-e)->p c =*e>2p c -l. (8.13) 



With the conjectured p c from equation ( |8.3|) , the equation above would lead to the requirement 

tllat N-l , 

e > 2 — (8.14) 

for odd N. This would imply that the amount of mixedness that multi-qubit entangled states 
can tolerate and still remain entangled increases exponentially with the number of qubits. That 
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this is indeed the case has been shown through other methods by a number of authors, see for 
example 

8.4.2 Feasibility of qubit communication protocol 

In this section we will work out the detector efficiencies and noise rates which allow for a qubit- 
communication protocol which is better than its optimal classical counterpart. We will see that 
the sufficient detector efficiency decreases monotonically with the number of parties. This means 
that a better-than-classical quantum protocol can be implemented in a feasible experiment. 

For the moment, let us assume that the only limitation in implementing the protocol is 
rj < 1 (we will deal with the more realistic case below). In case of a successful detection (which 
occurs with probability rj) the probability of success is equal to one. In case the detection 
fails (probability 1 — rj), the last party Pjy has to make a random guess about the value of F, 
succeeding only with probability 1/2. Thus for a higher-than-classical performance we need to 
implement the quantum protocol with a detection efficiency rj such that 

r] + (l-rj)->p c . (8.15) 

Thus, it is sufficient to have rj > 2p c — 1. In section |8.2.1| we have seen that the optimal classical 
protocol for N = 5 parties has a success rate p^ =ii = 5/8, and therefore can be beaten by 
the quantum protocol if the detection efficiency rj > 0.25, in the absence of other experimental 
losses. 

For a more realistic grasp of the experimental difficulties, let us examine a simple quantum 
optical setup that implements the quantum protocol for this problem. The flying qubit is encoded 
in the polarisation state of a single photon. For a fair comparison with the classical protocol, it 
is important to allow only a single photon per run to pass by the parties and arrive at Pj\r. One 
way to achieve this is to use a parametric down conversion crystal pumped by a laser. Detection 
of one of the twin photons generated can then be used as a trigger to let the second photon 
go towards the parties. For the triggering mechanism to work we need to introduce a delay for 
the second photon, which can be easily achieved by coupling it to a few meters of optical fibre. 
Upon detection of the first photon, the second photon is allowed to come through the N parties. 
Each party consists of an optical element using birefringent materials to perform the unitary 
phase shift given by eq. ([T^). In the end, the last party Pjy must also detect the photon in the 
proper basis. 

Such a setup has other imperfections that must be considered, besides the limited detection 
efficiency rj. The first is the finite transmissivity t of the combination of N birefringent plates 
used to perform the unitaries Uj. Another problem is the fraction (1 — /x) of detected events 
which are due to detector dark counts. Finally, even if the detected photon is a signal photon, 
the success rate s of the quantum protocol can be less than perfect, because of imperfections and 
misalignment of the optical elements that produce the initial state, introduce the phase shifts 
and measure the final polarisation. Taking all these limitations into account, for a higher-than- 
classical probability of success we would need: 



p e / f = y,rjts + [1 - fJtrit]- >p c - (8.16) 



Now let us make some realistic estimates for these parameters for the protocol with N = 5 
parties. By using quartz plates with anti-reflection coating, it is possible to obtain transmission 
of a fraction 0.995 of the incident photons per plate, which in the case of five parties would 
result in t = (0.9 95) 5 ~ 0.975. It is relatively straightforward to bring dark count rates below 
the 1% level [|125|1 , so let us take \i = 0.99. Good alignment of the optical elements should enable 
a success rate of s — 0.90 whenever a signal photon is detected; for ex ampl e, visibilities of up to 
96% in simple Bell tests using entangled photons have b een r eported |125f| . 

Plugging these estimates for t,n and s in inequality ( $.16 ), we see that in order to obtain a 



better-than-classical probability of success i t is s ufficient to have a detection efficiency rj > 0.33, 



which is within reach of current technology [125]. It is reasonable to conjecture that the optimal 



pj; continues to decrease for N > 7, in which case the sufficient detection efficiency could be 
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dramatically lower. In principle, one way to calculate p^f for N > 7 is through an exhaustive 
search over all deterministic protocols, as was done in section 3.2.1 for N = 3, 4 and 5. 

It is clear that essentially the same setup can be used to solve the Modulo-4 Sum problem 
using classical polarised light. In common with a qubit, classical light has a continuous variable 
(the phase) that can be manipulated, as opposed to classical bits that can only assume two 
discrete values. The counter-intuitive quantum feature that helps in communication complex- 
ity is the fact that even single photons still retain the continuous description of the classical 
electromagnetic field. More generally, a cf-dimensional pure quantum state is characterised by 
2(d — 1) real parameters that can be used for communication purposes, as opposed to the d 
discrete states available to a classical system of same dimensionality. Defining exactly for which 
communication tasks such a different resource can be used to advantage is a central research 
problem in quantum information theory. 

In the next chapter I will present a task in which the contrast between quantum and classical 
information is even more pronounced. We will see that the continuous nature of the set of 
quantum pure states enables a single qubit to encode more information that any finite number 
of classical bits. 



8.5 Chapter summary 

In this chapter I have analysed a simple multi-party communication complexity task, and dif- 
ferent quantum protocols to solve it for the cases of iV = 3 to 7 parties. Simple arguments and 
an exhaustive computer search enabled me to obtain the probability of success of the optimal 
classical protocol for two different situations: when the parties communicate directly with the 
last one, and when they communicate sequentially to one another until the last. 

In the first situation, we have seen that multi-party quantum entanglement improves the 
probability of success. I also found the detector efficiency and tolerable noise rates which are 
sufficient for the protocol to work with better-than-classical performance. These requirements 
are equivalent to those for a multi-qubit non-locality test, and were shown to be less stringent 
than those needed for a bipartite CHSH non-locality test. 

In the second situation I compared the optimal classical protocol with just one bit of sequen- 
tial communication with the quantum protocol in which a single qubit is communicated from 
party to party through a quantum channel. I showed that the required detection efficiency also 
decreases with the number of parties, and allows for a realistic quantum optical setup with a 
feasible detection efficiency rj = 0.33 to beat the optimal classical protocol. This is a unique 
case among quantum information applications, where the quantum performance is better than 
classical even in realistic experimental conditions. The higher-than-classical performance of the 
quantum communication protocol arises directly from the use of a quantum phase to encode 
information. 

If implemented, this would be the first experiment to demonstrate the superiority of quantum 
communication over classical communication for distributed computation tasks. 

In the next chapter we will analyse a similar information processing task, and show how 
a single qubit can be more efficient that an arbitrarily large number of classical bits in some 
situations. We will also see other motivations for implementing the feasible qubit-communication 
protocol discussed here. 



Chapter 9 

Encoding information in a single 
qubit 

A general pure state of a two-level quantum system - a qubit - can be written as 

|^)=a|0)+6|l), (9.1) 

where a and b are complex numbers satisfying \a\ 2 + \b\ 2 = 1. Given 

that the overall complex phase has no physical significance, we can describe state ([O]) with 
two real numbers. This means that the full specification of a single qubit's quantum state in 
general requires an infinite number of bits of information, necessary to specify these real numbers 
with arbitrary accuracy. 

This is in marked contrast with a classical two-level system, whose complete information 
content can be encoded in a single classical bit. Given this fundamental difference between a 
bit and a qubit, it is natural to ask whether we can use a qubit to communicate an arbitrarily 
large amount of classical information. 

Of course, it turns out that a single qubit of communication can at most be used to send 
one bit of information between two previously unentangled parties, Alice and Bob. This is the 



essence of Holevo's theorem [106], which was further refined by Nielsen [ |149| , p3| . Even in the 
case where Alice and Bob are allowed to use some previously shared entangled state, a single 
qubit of communication can result in a transmission of only two classical bits (using quantum 
dense coding jl^] ) . This is still a long way from the arbitrarily large amount of information we 
might have hoped for. 

The fact that we cannot use a qubit to communicate an unlimited amount of information 
raises doubts about the reality (i.e. objective existence) of the infinite amount of information 
encoded in a single qubit state. In this chapter I will show how the real nature of the parameters 
in eq. ( |9.1| ) can be used in an information processing application. The trick is not to require that 
this information can be actually read out, but rather to use it to substitute for an unlimited 
amount of classical communication during a distributed computation task. I illustrate this 
with a very simple information processing task requiring an arbitrarily large amount of classical 
communicatio n, bu t which can be done perfectly using a single qubit instead. 



In section [4.1.1 we have seen that quantum communication complexity results have already 



established that quantum communication can be more efficient than classical communication 
for many distributed computation problems. The task I discuss in this chapter was inspired 



by the Sum Modulo-4 communication complexity problem originally proposed in [35, and 
discussed in chapter ||. In a way, the task we discuss in this chapter is a continuous version 
of the qubit-communication protocol presented in chapter ||. This generalisation enables us to 
prove an unbounded separation in communication power between a qubit and any finite number 



of bits. In sections |9.4| - 19.6| we explore a number of consequences of this result, relating to issues 
in computational complexity theory and to the foundations of quantum theory itself. 

From a computational perspective, in section |9.5| I will show the relevance of this task in 
investigating the trade-offs between different types of resources (such as memory, time, accuracy 
in operations) available in the quantum and classical models of computation. We will see that in 
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Figure 9.1: We are allowed to take system S from points A to B in the x-axis, coupling it in any 
local way with the field <p(x). The task is to obtain one bit of information about (p(x)dx, as 
described in the main text. 



some instances the quantum advantage arises from certain characteristics quantum computing 
shares with analog computing, and not from quantum entanglement, for example. 

From a more foundational point of view, in section |9.4] I will argue that the unbounded 
quantum advantage can be traced back to a crucial axiom that different iates between classical 
probability theory and quantum theory, as proposed by Hardy |100| , 101 1 . It also implies some 
constraints on hidden-variable theories capable of reproducing quantum-mechanical predictions 
for a single qubit. 

A third way to interpret this res ult s heds light on an altogether different quantum resource: 
quantum entanglement. In section 9J3 I will show that no finite amount of one-way classical 
communication can perfectly simulate the communication advantage provided by entanglement. 



9.1 The task 

Let us now define an information processing task, whose classical and quantum solutions we will 
analyse in the next sections. 

We have a one dimensional real field (p(x) defined on the straight line between points A and 
B in the x-axis. The integrated value of the field is guaranteed to be equal to an unknown 
integer m, times a known real constant, a: 

B 

<p(x)dx = ma. (9-2) 

L 

The task is simply to find out whether m is odd or even by sending some physical system 
S directly from point A to point B (it is not allowed to move bac kwards). We are allowed to 
couple the system in any local way with the field (p (see figure |9.1|) , and we are searching for a 
deterministic protocol that solves the task with certainty. 

In the next section I will present two simple protocols that solve this problem, one using 
a classical system S and one using a quantum S. The classical S we use is a system with an 
infinite number of distinguishable states, such as a classical pointer. Communication with such 
a system effectively represents the transmission of an unlimited amount of classical information. 
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The quantum protocol I will consider also solves the task, but using only a single qubit of 
quantum communication. After presenting these two contrasting protocols, in section 9^ I will 
show rigorously that this huge difference in communication resources holds between the quan tum 
pro tocol and any classical protocol. The consequences of this will be explored in sections 9.4 , 
and f 



9.2 Two simple solutions 
9.2.1 A quantum protocol 

Let the quantum system S be a spin-half particle initially prepared in the spin up (along the 
z direction) state. Now let us couple the qubit to the field so that the spin orientation will 
rotate in the y — z plane by an amount proportional to the strength of the field. This constant 
of proportionality can be chosen to be such that, by the time S reaches B, the spin has been 
rotated through m/2 full rotations. Thus, if m is even then the spin will be up, and if m is odd 
it will be down. The task can then be solved by a simple measurement of the spin along the z 
direction at B. 

This protocol can be realized experimentally in a simple way using the following quantum 

optical setup. We encode the classical field (p(x) in the magnetic field B x = B ■ x in the centre 
of a solenoid. In principle, arbitrary functions <f(x) can be generated by using a non-uniform 
current density; the field B x will be proportional to the local current density. 

In the centre of the solenoid we have a transparent rod. If a photon with vertical polarisation 
is sent through the rod, its polarisation vector at the other end will have been rotated by an 
amount proportional to the integrated value of the field (due to the Faraday effect). The con- 
stant of proportionality associated with the interaction can be altered, for example by changing 
the physical properties of the transparent rod. The polarisation can be analysed at B along 
the vertical-horizontal directions. If the constant of proportionality is correctly adjusted, the 
polarisation at B will be vertical in the case where m is even and horizontal if m is odd, solving 
the task. 



9.2.2 A classical protocol 

To solve the problem with a classical system, let S be a rod pivoted at one end and free to 
rotate in the y — z plane. The position of the rod is given by a real number 6 whose specification 
requires an infinite number of classical bits. In other words, the rod is a classical system with an 
infinite number of distinguishable states. From the computational point of view we will develop 



in section 9Jj , we will see that the rod can be thought of as a classical analog computer memory, 
or alternatively as an infinite digital memory. 

To solve the above information processing problem, we start with the rod in the up direction. 
We couple the rod S to the field so that as it moves through distance dx we rotate it by rnp{x)dx. 
By choosing the coupling constant r] = ir/a, the rod will be rotated by m/2 turns. At the end 
the rod will be pointing up if m is even and down if m is odd. 

At this point it is natural to ask whether any classical solution really requires a system 
S with an infinite number of distinguishable states. After all, the final position of the rod is 
giving us much more information about <p(x) than we actually need; we can actually read out 
the value of the integral of (f(x) modulo 2a (given by the rod's final position) and not just the 
answer to our yes/no question. This indicates that a more careful analysis is required before we 
conclude that it is impossible to solve the task with a classical S having only a finite number of 
distinguishable states. In the next section I will prove that this is indeed the case. 
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9.3 Necessity of unlimited amount of classical communication 

The classical protocol I presented above requires communication using a classical continuous- 
variable system with an infinite number of distinguishable states. By using such a system, we 
are effectively communicating an unlimited amount of information, encoded in the continuous 
parameter characterising the rod's position. In this section I will prove that the stated task 
cannot be solved by any classical protocol using only a finite amount of classical communication, 
encoded in a system with any finite number of distinguishable states. 

We start by dividing the interval AB into N equal sections, each of length 1/N, and label 
them sequentially from 1 to N, starting from A. Now define (j) n to be the integrated value of 
the field tp(x) over the nth section. Constraint ( |9,2j ) now reads: 

JV 

4> n = ma. (9-3) 

n=l 

We do not need to impose any restrictions on the nature of the function <p(x) except that it 
can be integrated. If we want <p{x) to correspond to a physical field it must be continuous, but 
that is not required for our proof to work. 

Even with continuous <p(x), there is no constraint on the numbers (f) n except that they sum 
to ma. Hence, we can find fields ip(x) which correspond to any point in the coordinate space 
{4> n } for n = 1 to N — 1, with 0^ being chosen so that eq. (|9.3|) holds. 

By breaking the problem in N parts, we effectively have N parties in line, who can communi- 
cate sequentially from the first to the last one, with the objective of enabling the last one to solve 
the problem. After this simplification we see the similarity with the sequential communication 
version of the communication complexity problem discussed in chapter || 

If this communication task can be solved for an arbitrary field <p(x), then it can be solved for 
any subset of points in the coordinate space {4>n}- Hence, we can further simplify by choosing a 
discrete set of values that the cp n can take, to enable us to prove the result in a simple way. We 
allow each (f> n to take only discrete values ak n /K, where k n £ {0, 1, 2K — 1}. We will also 
require the integer K to be a power of two, for a technical reason that will become apparent 
later. Note that 4> n takes values in the range to a(2 — 1/K). This makes sense as the sum is 
only important modulo 2a (since we are only interested in whether m is odd or even). 

From now on we will refer to the k n 's instead to the </> n 's. Condition fl9.3| ) becomes 

N 

J2 k n = mK. (9.4) 

n=l 

Again, since we are only interested in whether m is odd or even we will only be concerned with 
sums over the fc n 's modulo 2K. 

9.3.1 Notation 

Now, let us establish a notation that enables us to discuss all possible classical protocols in- 
volving communication with a finite-dimensional classical system S. Let L denote the number 
of distinguishable states of S. Communication with a classical system with L distinguishable 
states is the equivalent of sending a message of size log 2 L classical bits. 

Each party has a number k n (or equivalently <p n ). The nth party receives S from the 
(n — l)th party. This will be in a certain state l n —i S {1, 2, L}. Since the first party receives 
no information we will put Iq = 1 so we can still use this notation. 

The most general protocol the nth party can follow consists of selecting a function /" (k n ) 
which outputs a number l n £ {1, 2, L}. He then prepares system S in this state l n , sending it 
on to the (n + l)th party. The protocols adopted by all the parties must enable the last party 
(iVth) to observe system S in state Zjv-Ij an d use this together with his local information kjy to 
determine whether m is odd or even. 

In the next section I will show that for a perfect classical protocol it is necessary to have 
L > 2N — 1. Since N and K are arbitrary and can be chosen to be as large as we want, the 
number of bits encoded in system S must also be arbitrarily large. 
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9.3.2 Proof 

We start by choosing the number of distinguishable states L to be less than what is necessary to 
encode each party's number; so let us set L = 2K — 1. The first party sends message l\ = fo(k±) 
to the second party. However, this function cannot be one to one since L < 2K. Thus, for some 
l\, there must exist a ^ b such that /d(a) = /q (b) = l\. Because of that, when the second party 
receives the message l\ he does not know whether k\ = a or k\ = b. 

The nth party will try to enable the next one to learn the value of the partial sum [k\ + ki + 

• • • + k n ) mod 2K, as this is the data that the last party needs to know in order to solve the 
problem. However, we have just seen that after just one communication step, there are already 
two different values of the first party's data k\ assigned to a particular message l\. The idea of 
the proof is to show that this uncertainty increases with each communication step, until there 
exists a message In-i that leads the last party to error. 

Consider the nth party. Let A n be the set of distinct values of the partial sum (ki + /c2 + 

• • • + k n -i) mod 2K which are consistent with the message l n -\ he has received. In the case of 
the last party, we identify here a possible cause for a flawed protocol. If A/v-i has elements that 
differ by K, there will exist one value of k^ for which the last party will be unable to solve the 
problem with certainty. 

The nth party will send l n = /" 1 (k n ) to the (n + l)th party. For some l n there must exist 
distinct a and b such that l n = // l _ 1 (o) = fo) (since L < 2K). If the (n + l)th party receives 
this l n then 

\A n+1 \>\A n ®{a,b}\, (9.5) 

where \A\ denotes the number of elements of the set A and A © B is the set of all distinct sums, 
modulo 2K, of one element from A and one element from B. Already we see that the number 
of elements of \A n \ cannot decrease as n increases. 

Let us now prove that any successful protocol must have 

\A n ®{a,b}\ > K| + l , (9.6) 

which together with (|9.5D means that |^4 n +i| is strictly larger than \A n \. 

The pro of w ill be by contradiction. Let us thus assume that there is a successful protocol in 
which eq. ( |9.6D is false. This is equivalent to saying that \A n © {a, b}\ = \A n \, as \A n © {a,6}| 
cannot be less than \A n \. Note also that \A n \ = \ A n © {a}\ = \ A n © {b}\. From our assumption 
and from writing 

A n © {a, b} = (A n © {a}) U (A n © {&}), (9.7) 

it follows that A n © {a} = A n © {6} (if they were not the same, their union would have more 
elements than each of them separately). 

Since we are doing modulo arithmetic we can think of the members of the set A n as being 
arranged around a circle. Then the effect of the ©{a} operation is to rotate them all forward 
by a and similarly for the b case. Hence, A n © {a} = A n © {&} implies that A n = A n © {\b — a\}. 
Put 1 6 — a\ = A\. We can apply this shift as many times as we like. Hence, A n = A n © {«Ai} 
where i = 1, 2, • • • . It is possible that Ai divides 2K in which case it is a period of the set A n . If 
it is not period then we can still prove that A n must have a period. To see this note that either 
Ai is a period or there exists an integer i\ such that < iiAi mod 2K < A\. Put A2 = «iAi 
mod 2K. Now, A n = A n © {1A2} and hence either A2 is a period or there exists an integer 12 
such that < ^A2 mod 2K < A2. We can continue in this way generating a sequence of A^'s. 
At some point this sequence must terminate since Aj + i < Aj and Aj > 1. The last member of 
the sequence must be a period of A n . Let this period be v. 

As we have chosen K to be a power of two, this divisor v must be either equal to one, or to 
some power of two as well. These two possibilities entail the existence of elements of A n which 
differ by K, leading to an error. Since we assumed from the beginning that there was a successful 
protocol in which eq. (9.6) was false, to avoid a contradiction we are forced to conclude that eq. 



must be true. 

Thus, in any successful protocol eq. (|9.6|) must hold for all n < N. The last party then 
may receive a message for which | ^4 a^— 1 1 = N. If this number is larger than K then there must 
exist elements in .Ajv-i separated by K, leading to an error. So we need K > N. Now let us 
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remember that we fixed the number of distinguishable states in system S to be L = 2K — 1. 
This means that any flawless protocol requires that L > 2N — 1. As K and N can be chosen at 
will, this proves that any classical protocol for the task proposed requires a system S with an 
arbitrarily large number of distinguishable states. 



9.4 Why is quantum better? 

It is instructive to revisit the classical protocol and spot the reason why the rod could be 
replaced by a qubit. The rod's coupling with the field 4>{x) takes it through its continuous set 
of distinguishable states in such a way that its final state encodes the property of <j>(x) that we 
want to know. The key point to note is that the protocol works just as well if we n ever acquire 
information about the rod's state until the final point B. There, constraint ( |9.2D guarantees 
that acquiring a single bit of information about the rod's state is enough to solve the task. The 
rod offers us more than we actually need: it is enough to have just two distinguishable states for 
the read-out, plus the continuous set of pure (but non-distinguishable) states for the encoding. 
This is precisely what a qubit offers us. 

It is clear that the task could be solved by using a clas sical electromagnetic field instead 
of a single polarised photon, as we have discussed in section |9.2.1 . The protocol using a single 



photon, however, is as economical as possible, working perfectly with a communication consisting 
of the smallest possible physical system. Besides this practical point, we have to remember that 
a classical electromagnetic field, with its infinite number of distinguishable states, is only an 
approximation to the actual physical properties of the quantum electromagnetic field. 

By quantising the classical electromagnetic field, we obtain a more physically accurate de- 
scription of the system, and find out that the single building blocks (the quanta) retain a curious 
mix of characteristics. Unlike the classical field, they have only a finite number of distinguish- 
able states (hence the term 'quanta' to describe them and the whole theory). On the other 
hand, they somehow still retain the continuous classical description of the classical field they 
constitute, by having a continuous set of pure, but non-distinguishable, states. 

Interestingly, the existence of this continuous set of pure states turns out to be the distin- 
guishin g tr ait between quantum theory and classical probability theory. We have seen this in 



section |2.4| , where we briefly presented Hardy's axioms for quantum mechanics. 

In Hardy's approach, the crucial difference between quantum and classical physics is the 
continuous nature of the set of quantum pure states, as opposed to the discrete nature of the set 
of classical pure states. As we have seen, it is also this that allows for an unbounded separation 
in communication power between a qubit and any finite-dimensional classical system. 

We thus see that the unbounded separation we have noted is not simply a curious event 
arising from some peripheric characteristic of the single qubit case. It actually results from 
the unavoidable, fundamental trait that distinguishes quantum theory from classical probability 
theory. 

The result I have presented in this chapter places a significant constraint on any hidden- 
variable theory (HVT) describing a single qubit. It implies that the HVT must have at least one 
continuous parameter, capable of taking on an arbitrarily large number of values. While this is 
indeed the case for Bell's deterministic hidden- variable model for a single spin-half particle |l3| , 
my result shows that this is an unavoidable trait of any HVT consistent with quantum theory. 



9.5 A computational perspective 

In this section we will analyse the information processing task I proposed from a computational 
perspective. I will start by re-stating the bound on communication resources as a bound on 
the sizes of classical and quantum memories necessary for some computations. This is a result 
in computation space complexity, which can be motivated using the 'ROM-based' computation 
model of Travaglione et al. [181], which I also present in the next section. 



Then I will show that this quantum advantage in memory size comes at the expense of an 
exponentially increasing accuracy required for the quantum operations. Fortunately, we will see 
that this accuracy can be reached with polynomial resources at each party. 
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I will argue that this is an example of the more general problem of computational resource 
trade-offs between different models of computation (in this case, classical and quantum). 

In the remainder of the chapter we will often be analysing how different resources scale with 
the problem size. It is natural to take n = log 2 N as the problem size, where N is the number of 
parties in the discretized version of the communication protocol introduced earlier. While there 
are other ways to define problem size, for our purposes it is sufficient to stick to a consistent 
definition, as we will be only concerned with the asymptotic differences between the quantum 
and classical models. 



9.5.1 Time- versus space-complexity 

When computer scientists analyse the complexity of an algorithm, they usually put more empha- 
sis on the time complexity, which deals with the expected number of computational operations 
needed for performing an algorithm. It has been shown that quantum computing offers a de- 
crease in the time complexity of important algorithms: a quadratic speed-up for database search 



(Grover |93|]) and a conjectured exponential speed-up for integer factoring (Shor [174, 175]). 

Time, however, represents only one of the resources necessary for computing. Space com- 
plexity analyses how much writable memory is necessary to perform different algorithms, and 
how the size of this memory scales with problem size. This problem has been addressed in 



a few recent papers j|, 205, 181, 178], but in general it has drawn much less interest than 
time-complexity. 

In fl81| Travaglione et al. discussed what they called the 'ROM-based model' of computation, 
and were able to obtain a simple analytical result comparing the classical and the quantum 
space- complexity. Since this is important for understanding my result, let us have a look at 
their findings. 



9.5.1.1 ROM-based computation 

If we want to analyse space efficiency of computers, it is convenient to distinguish between 
that part of the memory that is used for the read-only data, and the part of the memory 
that is actually operated upon during the computation. This motivated Travaglione and co- 



workers [181] to define a deterministic, reversible model that enabled them to compare the 
space-efficiency of quantum and classical computers. 

Their model can be presented as follows. All the input data necessary for the computation 
is available as a Read-Only Memory (ROM) of j bits U1U2U3, ...,Uj. Besides this ROM, we have 
also m (qu)bits as writable (qu)bits, which can be operated upon one at a time by reversible 
gates conditionally on the value of one of the ROM bits. The gates are required to be reversible 
for a fair comparison between the quantum and the classical models, since the quantum memory 
necessarily undergoes a unitary (hence reversible) evolution. After many such computational 
steps, in the quantum case we require the quantum writable memory to be in a disentangled 
state, consisting of the chosen computational basis states. This results in a computation which 
is deterministic and reversible, in both cases, classical or quantum. 

A ROM-based computer is called universal if it is capable of transforming the m writable 
(qu)bits to any one of the 2 A (m2 :) ) possible boolean outputs of the j ROM-bits. Travaglione 
et al. proved that universal computers in this definition require only a single qubit of quantum 
memory, or two bits of classical memory. Their proof, however, does not state anything about 
the difference in time-complexity between the quantum and the classical models. 

Next we will examine the relation between this model of memory-bounded computation and 
the task we are studying in this chapter. We will see that our task corresponds to a specific 
ROM-based computation, in which we first constrain the number of times the ROM bits can 
be read, and then ask what the size of the minimal computer memory must be for a certain 
deterministic computation. 
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9.5.1.2 Task as a ROM-based computation 

We can view the discrete version of the protocol we have studied in this chapter as such a 
ROM-based computation. We start by defining a function g : m — > k, where m G {1,2, N} 
represents each of the parties and k £ {0, 1, ...,2K — 1} represents the numbers each party has. 
The constraint on the integral of (j>{x) translates as a constraint on the function: 



The Boolean function we want to evaluate is one which distinguishes between these two possi- 
bilities. 

The role of the ROM bits is played here by each party's data, which they can use to condi- 
tionally apply unitaries on the quantum register, or to change the state of the classical digital 
register. We have a 'ROM call' each time a party uses his data to operate on the quantum 
memory. 

The universality result for ROM-based computation means here that this task can be solved 
either with a classical memory of just two writable bits, or with a quantum memory of a single 
qubit. It is possible that there exists a huge gap in time-efficiency between these two space- 
efficient computations; however, as far as space complexity goes, the separation between classical 
and quantum in this case is very small. 

The problem we solved in this chapter imposes a further constraint: each party has access 
to the memory only once, before passing it to the next party in line. This corresponds to a 
constraint in time: the protocol must end after exactly N operations on the memory, one by 
each party. So here we are actually imposing constraints both on time- and space-efficiency. We 
start by fixing the allowed time-complexity (in number of ROM calls) to be N, and then we ask 
what the minimal memory size is, for classical or quantum memories. 

By fixing the time of the computation to be N function calls, we are effectively allowing 
the parties the exact minimum number of ROM calls necessary for the computation. This is 
true because a change in a single party's number can flip the overall sum from modulo IK to 
K modulo 2K, changing the result of the computation. This means that if we miss any single 
party's number, we could be led to error in the final result. Therefore, the minimal number of 
ROM calls necessary for this task is exactly one per party, or N in total. 

Now let us re- interpret our result: we have shown that with this minimal time complexity, 
any classical ROM-based computer needs a writable memory of at least log 2 (2iV — 1) bits. 
In terms of the problem size n = log 2 iV, the classical memory is 0(n), i.e. linear in n. A 
quantum computer, on the other hand, succeeds with a fixed memory of the smallest possible 
size, consisting of a single qubit. This is a striking separation between the capabilities of these 
two models of computation. 

Here we must add a word of caution about our time-complexity definition. As is common in 
such discussions, we adopted a time complexity which is measured by the number of function 
calls, sometimes called in the jargon oracle calls. This, however, can be misleading, as each oracle 
call can involve a complex quantum operation which requires a large number of elementary gates 
to be performed. 

For a more physically motivated definition of the time complexity, we need to actually address 
the complexity of implementing each oracle call. As each party has a classical number, our oracle 
already consists of a classical oracle in both the quantum and classical cases, which is good. But 
depending on the oracle result, each party has to implement a unitary on the qubit, whose 
complexity we must evaluate. 

In the next sections we tackle this problem. We will start by showing that the quantum 
algorithm with small error 5 requires an exponential accuracy for each party's unitary operation, 
which is bad news. Surprisingly, however, we will see that this exponential accuracy in each 
unitary can be reached with a number of gates which is only polynomial, which is good news. 

This will enable us to compare the quantum and classical computations in a fairer way. 
We will see that, with the same number of function evaluations, quantum computation offers 
a polynomial space advantage over classical computation, while requiring a polynomial time 
overhead. 



N 




(9.8) 
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9.5.2 On the accuracy needed 

Even in classical computational complexity theory, there are mor e resources to account for, 



besides time and space. For example, the work of Landauer [129] has shown that there is a 
lower bound on the energy consumption of irreversible computation. This motivated the study 
of classical reversible computation, which can be done essentially with no energy consumption 



[14|. In today's computers energy use is not a major concern, but this may change in future 
computer architectures. 

When we consider quantum computation, it is not yet completely clear which resources will 
prove important to consider, from a physical point of view. Suppose, for example, that in the 
next decades we obtain a better understanding of quantum theory, or maybe a more successful 
theory that supersedes it. It may be the case that such a theory imposes restrictions on the 
type of entanglement that can be physically produced, through decoherence, for example. This 
possibility is already being considered in different contexts, for example in Penrose's idea for 
gravitationally-induced decoherence |153| ]. Then it would become important to quantify how 
much of this 'difficult' entanglement is required by quantum algorithms, and count that as a 
computing resource to be minimised. 

It is already clear, however, that the accuracy with which we can implement single qubit 
gates is limited. Much work has been done to investigate what is the number of gates required 
to achieve the necessary accuracy for quantum unitaries during a quantum computation. For 
an introductory treatment of this problem and references, see Chapter 4 of the book by Nielsen 
and Chuang | 151 |. 



This issue is important in the context of the computational problem I have presented in 
this chapter. The first thing to note is that for an absolutely perfect quantum protocol, the 
accuracy needed is infinite, as any small inaccuracy in the implementation of a single unitary 
in general leads to some probability of error at the end. This, however, can be also said of the 
digital memory involved in the classical protocol, as its performance also assumes perfect circuit 
elements. 

What we can do from a theoretical point of view is to calculate the accuracy needed to 
implement each quantum unitary in our problem, so that the quantum protocol succeeds with 
high probability 1 — 5. We will see that the accuracy needed increases exponentially with the 
problem size n. 

The feasibility of quantum computation networks depends on some fundamental results. One 
of the first things we need to know is whether it is possible to compose many imperfect unitary 
operations, in such a way that the final probability of error does not increase too fast with the 
number of unitaries. Fortunately, it was shown (see section 4.5.3 of [p. 5 1|| ) that this is indeed 
the case. More precisely, in order to approximate iV successive unitaries with a final probability 
of error smaller than 5, it is enough to approximate each unitary with an error bounded by 
e < 5/{2N). In other words, the errors in the individual unitaries compose only linearly with 
the number of unitaries applied. 

If we go back to our problem, this means that each party must be able to implement his 
unitary with an error of at most e = 5/(2N). This may not seem too bad, but we must remember 
that the problem size is given by n = log 2 N, where N is the number of parties. This means that 
the accuracy necessary for each party's unitary operation increases exponentially with problem 
size. 



9.5.3 Achieving the accuracy 

It may come as a surprise that this exponential accuracy can be achieved with just polynomial 
resources at each party. In order to evaluate the resources needed, we need to assume that each 
party can implement a universal set of only two single-qubit gates with perfection. These can be 
the 7r/8-rotation and Hadamard gates, for example. This assumption may seem too idealistic, 
but represents the same kind of assumption we made to derive the classical digital protocol - 
that we can perform classical bit operations perfectly. 

Going back to the quantum case, each party then needs to apply a certain sequence of these 
gates in order to approximate the desired unitary to accuracy e. How many times does he need 
to apply the gates? 



CHAPTER 9. ENCODING INFORMATION IN A SINGLE QUBIT 



88 



The Solovay-Kitaev theorem gives a near-optimal bound answering this question [12C[. It 
was shown that to approximate a unitary with an error of at most e, we need to apply the gates 
from our discrete set only 0(log2(l/e)) times, where c is a constant approximately equal to two 
Q. As we have seen above, for a computation with error bounded by 5, we require e < S/(2 n+1 ). 
Using the Solovay-Kitaev result, we see that this requires each party to use 0(log2 2 n /S) gates, 
which is only polynomial in n. 

So we see that the exponential accuracy at each party can be reached with a number of 
gates which is polynomial in n. Since the number of parties is exponential in n, so is the total 
number of gates necessary for a quantum computation with high probability of success. We 
must remember, however, that this is also the case for the classical computation. 

To summarise what we found in the last two sections, we have seen that our task can 
be computed either with a digital classical memory or with a quantum memory. The same 
exponential number N = 2 n of function calls is necessary in both cases, and each function call 
requires a manipulation of the memory involving a polynomial number of gates. In the classical 
case the number of gates at each party is O(n), that necessary for adding two n-bit numbers. 
In the quantum case this number is 0(n c ), where c is approximately equal to two. Interestingly, 
with the same number of function calls a quantum computer performs the computation using 
only a single qubit memory, as opposed to the necessary 0(n)-bit classical memory. 

This is just one example of a trade-off between computation resources in different models 
of computation. In our case, we increase the number of gates used by a polynomial amount, 
while reducing the memory used also by a polynomial amount (from 0(n) to 1, i.e. constant). 
Studying trade-off compromises improves our understanding of the advantages provided by dif- 
ferent models of computation, such as analog or quantum. It is also important for experiments 
involving small-scale quantum computers, as it helps quantify the advantage one can get with 
l imit ed resources of different types. Some related work has been presented by Sypher et al. in 
[]178[| , where some simple computations using a quantum computer with one and two qubits were 
experimentally performed. 



9.5.4 Quantum versus analog computation 

In the sections above we have been comparing a perfect digital classical computation with a 
quantum computation using a qubit that succeeds with high probability 1 — 5. As we have 
noted already in section |9.2.2 , however, there is a simple way of solving the task with an analog 



classical memory (a rod). In this section I compare the quantum computation using a qubit 
with this classical analog computation. 

It is important to note that the classical analog computation model is not physically sound - 
at some point quantum phenomena and the atomic structure of any analog device will make the 
system deviate from our classical expectations. This is already a good argument for consideration 
of quantum systems instead, because they reflect the computational capacity of Nature to the 
best of our current knowledge. 

Unfortunately, there is relatively little reported research work on the complexity of analog 
computation. It has been noted that it is possible to make some analog models which solve 



factoring and the satisfiability problem in polynomial time [188]. In particular, there has even 
been a proposal for a prototype of a fast factoring device using light-emitting diodes (LED's) and 
photodetectors [172, 134, 173|. These models, however, in general have have limited scalability 



due to exponentially increasing resources of some kind. Defining which resources have to be 
considered in alternative analog models of computation is an open research problem. Some 
aspects of analog computation complexity were discussed by Vergis et al. in \194]. 



This notwithstanding, we can at least compare some characteristics of the ideal quantum and 
(analog) classical computation of our task. Let us start by presenting some of the advantages 
of the analog classical rod over our qubit-based computation. First, the rod is more powerful 
because it enables us to read out J (p(x)dx mod 2a, instead of the just the single bit necessary 
for our task. Moreover, the analog computation is robust against small imperfections in the 
rotations, as the rod still ends up close enough to the ideal final position for us to read out the 
correct answer. 



1 More precisely, a proof can be given for 2 < c < 2 + r/, with r\ arbitrarily small. See [151, Appendix 3]) 
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Figure 9.2: Instead of sending the qubit through a quantum channel, we can teleport it from 
party to party. Each party needs to transmit two bits to the next party Pj , who then implements 
a suitable unitary T to complete the teleportation. Then the unitary Uj is applied, depending 
on Pj's data. After that the party teleports the qubit to the next one, through a Bell-basis 
projection of the two qubit system consisting of the resulting qubit, plus the qubit entangled 
with the next party. 



The quantum computation with a qubit is not robust in that sense, as any imperfection 
in one unitary generally results in some probability of error at the end. This is not much of 
a problem if we accept a small probability of error 8 at the end. The demands for such a 
probabilistic quantum computation were evaluated in the last two section, in terms of accuracy 
and number of gates. Despite the fact that it is not clear how to weigh computational resources 
in analog computation, it is reasonable to claim that the quantum computation with a qubit is 
enormously more economical in space than the analog computation. Any analog system used as 
a memory for our task must be physically constituted by a great number of truly fundamental 
quantum systems, each of which can be compared with the qubit. 

A fairer comparison of the quantum and analog solutions to our task would demand further 
work to be done. First, we would need to establish a physically reasonable model for errors in the 
rod's rotations, and evaluate the computations' tolerance to them. Then we could compare that 
to the analysis we did in the previous two sections, for an error-bounded quantum computation. 



9.6 Simulating entanglement with classical communication 

Entanglement figures as a resource in a number of communication tasks. It provides space-like 
separated parties with a means of obtaining a string of bits which are correlated more strongly 
than is possible with classical, local physics. If the parties are time-like separated, however, 
the same strongly correlated string of bits can be obtained using some classical communication 
between the parties. By studying how much classical communication is necessary to simulate the 
presence of entanglement in this sense, we can get a better understanding of the communication 
advantage provided by entanglement. 

The task we have been studying in this chapter can be used to shed some light on the cost 
of simulating entanglement with classical communication. In order to see how, let us go back 
to the discrete version of the quantum protocol (with N parties), and imagine that the parties 
do not have a quantum channel to send the qubits through. Instead of offering the parties the 
advantage of quantum communication, we give them quantum entanglement: each neighbouring 
pair of parties starts with a maximally entangled pair of qubits. Now they can overcome the 
lack of a quantum channel and solve the task using (N — 1) teleportation steps (see figure |9.2| ). 
Each teleportation step requires only two bits of one-way communication, plus the previously 
shared entang lemen t. 

In section |9,3.2] we proved that no finite amount of one-way classical communication can 
perfectly solve the task. The procedure outlined above, however, succeeds with only two bits of 
one-way classical communication, plus entanglement. We are thus forced to conclude that no 
finite amount of one-way communication can perfectly simulate the communication advantage 
provided by entanglement. 
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This contrasts with previous results obtained by various authors, which show that entangle- 



ment simulation can be achieved with a finite amount of communication only [23, 47, 141 1- Next 



we will have a brief look at these results, and show how they can be reconciled with our proof. 
9.6.1 Some results on entanglement simulation 

Brassard et al. [ |23| j start by defining exactly which quantum measurements we are trying to 
simulate, and the means allowed. They define a quantum measurement scenario with a triple of 
the form (\^)ab , Ma., Mb), where \^)ab is the bipartite quantum state in consideration, and 
Ma and Mb are the sets of measurements at A and B whose results we will try to simulate. 

We are allowed to use any local hidden variable scheme, augmented by a certain amount of 
classical communication between the parties (which we will try to quantify). Their main result 
can be stated in a theorem: 



Theorem (Brassard et al. [ p3|[ ): For the quantum measurement scenario (\<&) AB , Ma, Mb), 
with 1^)^ = (| 00) + 1 11)) and Ma, Mb consisting of any von Neumann measurement 

on each qubit, there exists a local hidden variable scheme augmented with eight bits of 
communication (from Alice to Bob) that exactly simulates it. 



Another protocol related to entanglement simulation was presented in [gTJ , where Cerf , Gisin 
and Massar discussed what they called 'classical teleportation'. They considered the situation 
in which Alice needs to send information to Bob about a known state she has, in such a way 
that Bob can reproduce any measurement outcome on that state. This is related to Brassard et 
al.'s protocol, but there are some significant differences. Unlike them, Cerf et al. consider more 
general POVM's, and not only projective measurements. 

Their main result is that on average, only a finite amount of bi-directional communication is 
enough to do this 'classical teleportation'. Note, however, that our task demands only one-way 
communication, which means that their result need not apply in our case. 

In the next section we will discuss the relation between our task and the type of entanglement 
simulation discussed in the two papers briefly reviewed here. 

9.6.2 Simulating our task 

The results reviewed in the previous sections imply that a finite amount of communication is 
sufficient to simulate entanglement. In this section we will qualify this statement, explaining 
why it does not apply to the simulation of entanglement-based version of our task. 

The theorem by Brassard et al. implies that it is easy to simulate one teleportation step 
with classical communication. This may not be intuitive but, given their results, we must accept 
that some operations (involving a few real-valued parameters) on a single entangled pair can 
be perfectly simulated with classical communication. Despite the communication being of just 
a few bits, we must remember that the parties can previously share an unlimited amount of 
instructions, i.e. local hidden variables. 

The situation proposed in our task is different, as the outcomes of the measurement made 
at the last party actually depend on a (potentially unlimited) chain of real-valued processes 
affecting the quantum systems used, such as unitaries and POVM measurements. To see this, 
let us consider what each party needs to do (see figure |9.2]) . Each party has two qubits, one 
entangled with party Pk-i (qubit 1) and another entangled with party Pk+i (qubit 2). Upon 
receiving the two classical bits from Pk—i, party P^ applies the corresponding unitary T^-i on 
qubit 1 to complete the teleportation process. Then, P^ will do a special projective measurement 
on the joint system of qubits 1 and 2. This measurement consists of first applying a unitary 
to qubit 1, which is dependent on P^'s data; and then projecting the qubits on the Bell basis, 
broadcasting the result to party Pk+i- These two steps can be viewed as resulting in a POVM 
measurement on each of the qubits. Because the promise of the problem does not impose any 
restriction on each party's unitary, each such POVM measurement is effectively random. 

In order to perform the task with classical communication only, we would need to simulate 
perfectly the projective measurement performed on the qubit by the last party. However, this 
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requires information about all the POVM's performed by all the previous parties. The bound we 
obtained shows that this amount of information, if conveyed by classical means, is unbounded, 
increasing with the number of parties. 

It is interesting to view our entanglement-based task from a slightly different perspective. We 
can consider the (N— 1) entangled pairs of qubits as one long quantum system, whose constituent 
qubits we can measure at will. The particular sequence of measurements that implement our 
entanglement-based protocol tells us something interesting about information flow in quantum 
systems. We have proven that quantum mechanics (or any hidden- variable theory compatible 
with it) necessarily requires an unbounded amount of one-way information flow between the 
ends of this entangled chain. 

This result is related to the necessity of real-valued variables in any hidden-variable theory 
for quantum mechanics, which we discussed in section 9.4. 



9.7 Chapter summary 

The description of quantum systems and operations involve real numbers, and as such require 
an unbounded amount of classical information. However, we can only send one bit of classical 
communication per qubit of quantum communication. 

In this chapter we explored a simple communication task which shows that quantum theory, 
and indeed any hidden- variable theory consistent with it, necessarily requires real- valued vari- 
ables. What is more, I have shown how these real-valued parameters can be used to advantage in 
distributed computation applications involving either qubit communication or use of quantum 
entanglement. 

I have shown that in this application the quantum-based protocols outperform any classical 
protocols using only a finite amount of classical communication. In particular, I have shown 
that in order to account for the information flow required by measurements on entangled states, 
we necessarily need an unbounded amount of one-way classical communication. 

From a computational perspective, we have seen that the simple task we analysed clarifies 
possible computational resources trade-offs we can obtain when changing from classical to quan- 
tum computation. In particular, I have discussed how quantum computers can offer a memory 
size advantage, while requiring more computational steps for the same task. Consideration of 
this kind of resource trade-offs is important for appreciating and maximising the computational 
capabilities of quantum or analog computers. 



Chapter 10 

Conclusion 



In this thesis I have tried to establish connections between simple quantum information protocols 
and some foundational issues in quantum theory. In this last chapter I comment on my main 
results and suggest some open research problems related to them. 

In chapters |7|, |8] and ^ I examined a few simple applications and identified the fundamen- 
tal quantum traits behind their better-than-classical performance. A generally acknowledged 
quantum resource is quantum non-locality, which is indeed essential in some of the applications 
reviewed. In this thesis I have identified two other quantum resources giving better-than-classical 
performance for some information processing tasks: quantum contextuality, and the continuous 
nature of the set of pure quantum states. 

Regarding contextuality, it may have come as as surprise that in chapter |2| it was presented 
as a more fundamental quantum property, from which non-locality follows. There are two main 
reasons for presenting contextuality the way I did. 

The first reason is simplicity: I think it is conceptually easier to first understand the im- 
possibility of assigning (non-contextual) values to experimental outcomes, and then impose the 
locality condition and try to do the same. This way of putting things reveals non-locality as a 
manifestation of contextuality with a 'twist', i.e. contextuality of space-like separated measure- 
ments. 

This brings us to the second, and more important, reason for discussing contextuality. My 
preoccupation with experimental feasibility of quantum information applications led me to real- 
ize that in most circumstances the quantum measurements involved are not space-like separated. 
Strictly speaking, this rules out quantum non-locality as the characteristic responsible for their 
higher-than-classical p er for mance . 

Some researchers have been aware of that for a long time, and prefer to use more accurate 
terms such as 'quantum correlations' instead of non-locality. At the same time, from a more fun- 
damental perspective, there has been a drive towards understanding the nature of the 'loopholes' 
that remain open in quantum non-locality testing, such as inefficiencies in detection, imperfect 
state preparation and the lack of strict locality conditions for the measurements. 

In chapters [?] and |8| I revisited these ideas to stress the point that non-locality tests without 
strict locality conditions (i.e. without closing the locality 'loophole') are in fact equivalent to 
contextuality tests. Some relations between contextuality and non-locality have been pointed 



out by other authors |144|, 146, 177]. In this thesis I make the connection explicit, and show how 



contextuality can be used in experimental implementations of quantum information applications. 
It is not just the case that feasible implementations fail to impose locality conditions - many 
times these cannot be imposed even in principle. That is the case of the 2 — > 1 QRAC using 
qubit communication, in which Alice prepares a state and sends it to Bob. 

In my opinion quantum contextuality should be given more importance as a potential re- 
source in qua ntum information applications. This point has been implicitly addressed also by 
Vaidman [184], who argued for the importance of using what he calls 'quantum games' to perform 
information processing and communication tasks which are impossible classically. These appli- 
cations are of the type that I describe in this thesis, relying also on the stronger-than-classical 
correlations offered by quantum contextuality. 
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Contextuality and non-locality can be considered as resources to be understood and quan- 
tified for use in quantum information applications. This leads us to the subject I discussed in 
chapters ||] and ||, the characterisation of tripartite quantum entanglement. 

An interesting feature of multipartite entanglement is the fact that it appears in a number of 
qualitatively different types. Classifying these types is quite challenging, but may help identify 
states which may be potentially useful due to their unusual entanglement structure. Two exam- 
ples of such unusual states ar e th e four-party cluster states ( |5.3| ) used in an alternative model 
of quantu m com putation [24, 163], and the W states ([O]) use in non- locality and contextuality 
proofs [^, |219| , and in an application involving quantum games |95[| . 

In chapter [| I identified several good candidates for displaying a new type of tripartite 
entanglement under asymptotic transformations. At least one such family of states has since 
been rigorously shown to be inequivalent to the previously known tripartite entanglement classes 
||. It would be interesting to prove the same for the other states I singled out in chapter [|. 

More generally, one may ask whether there exists a finite Minimal Entanglement Generat- 
ing Set (MREGS) for tripartite entanglement. It may actually be the case that multipartite 
entanglement manipulations are genuinely irreversible, as happens with bound entanglement in 
the bipartite setting. Identifying the causes of this could be very interesting from a theoretical 
point of view. 

In section 5.5 I gave an argument showing that in at least one well-motivated sense, the 
W state of three qubits is more entangled than the GHZ state. In the near future I intend to 
investigate this question more closely, with the aim of checking whether it is possible to explicitly 
describe non-locality tests supporting this claim. 

Changing direction a bit, we may ask what fundamental laws govern the distribution of 
quantum information, and how to make optimal use of the manipulations that are allowed. The 
no-cloning theorem offered an initial negative result, which motivated further research into the 
allowed processes of quantum information distribution. In chapter |6| I investigated how quantum 
cloning could be useful in quantum computing. This resulted in the two examples of tasks for 
which cloning offers some advantages, using two different types of cloning protocols. 

An interesting research question on cloning consists in trying to find a more fundamental 
justification of the optimal achievable cloning fidelities. A step in that direction was provided 
by Gisin's work bounding the fidelities from the requirement that there should be no faster- 
than-light signalling |$7|. I suspect approaches deriving from quantum information may prove 
useful for this problem. For example, one can establish bounds on the cloning fidelities using 
the invariant information of Brukner and Zeilinger, in a similar way as I did with QRAC's in 



section 7.5. Another suggestive result is the curious fidelity balance I found in section 4.3.4 . 
Clearly, there is much still to be understood about quantum information distribution. 

Physical theories and theoretical applications have one thing in common: their justification 
comes only through successful experiments. Some quantum information applications may be 
experimentally feasible already, or may become so in the near future. In chapters |7] and ||| I 
demonstrated the feasibility of a couple of applications, and also argued that their implementa- 
tion would be equivalent to tests of properties such as non-locality and contextuality. 

Given these first results, it is clear that more feasibility studies are necessary at this point. 
There are a number of quantum information applications which seem promising and possibly in- 
teresting from a foundational aspect, among them quantum cryptographic protocols, absorption- 



reduced imaging [127, 115, 113, 142], and quantum games (see WM and references therein). 

In chapter |y I proposed an information processing task in which a single qubit is used to 
encode information which would require an unlimited classical digital memory to store. Surpris- 
ingly, this has bearings on different problems: space complexity theory, general hidden-variable 
theories, and on the classical communication cost of simulating entanglement. 

In this case, the advantage comes not from the exponentially large Hilbert space (as in Shor's 
algorithm) or from entanglement (as in teleportation), but from the continuous nature of the set 
of pure states available to quantum systems. This quantum characteristic is central in Hardy's 
axiomatic formulation of quantum theory, and had not been explicitly identified as an important 
quantum information resource up to now. 

This suggests that in some senses quantum computation is similar to classical analog compu- 
tation. An open research question consists in clarifying the similarities and differences between 
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these two models, with a view to understanding better the quantum advantage provided by en- 
tanglement, for example. As we have seen, it is also important to understand quantitatively the 
requirements on computational resources such as space, time and accuracy. Regarding the task 
studied in chapter ^, it would be interesting to quantify the trade-offs possible between these 
resources, in the three models of computation: classical digital, classical analog, and quantum. 

The general approach I took in this thesis suggests different motivations for experimental 
implementations of quantum information protocols. This comes from the realization that simple 
experiments may actually correspond to a number of different applications. For example, in 
chapter ^ I showed that one simple experiment can be interpreted alternatively as: a locality or 
contextuality test; or as protocol for remote state preparation; or as a quantum communication 
complexity protocol; or as a 2 — > 1 quantum random access code. Another example is the feasible 
one-qubit communication complexity protocol of chapter || which can also be interpreted as an 
experiment to demonstrate that a one-qubit quantum computer is better than a one-bit classical 
computer for some computations. This follows from the computational interpretation of the 
generalization of that task developed in chapter ^. 

The applications I analysed help clarify some of the requirements for experiments to establish 
some fundamental quantum characteristics. For example, the multi-party entanglement-based 
communication complexity protocol of chapter || automatically gives us sufficient conditions for 
establishing multipartite non-locality or contextuality. In the same chapter I also analysed the 
experimental requirements for a demonstration that quantum communication can be better than 
classical communication for some tasks. 

Of course, there are many other approaches one may use to gain insights on the foundations 
of quantum theory. Most attempts to unify quantum theory and gravitation, for example, 
seem to acknowledge the importance of recovering the information-theoretic description of black 
hole entropy obtained by Beckenstein [11]. O ther authors h ave uncovered interesting analogies 



between entanglement and thermodynamics [ 16C , 191 , 13E [. Also, there are results suggesting 



that quantum phenomena may bring surprising consequence s to thermodynamics, for example 
by improving the efficiencies of some types of heat engines [171]. On t he ex perimental front, 
recent experiments revealing entanglement between macroscopic objects | 118 | suggest that the 
robustness of large-scale entanglement may be investigated experimentally. Needless to say, 
these techniques may also be useful for practical quantum information applications. 

These different approaches seem to be complementary, pointing towards an understanding of 
quantum theory which is being shaped around the idea of information as a foundational concept. 
From a more practical point of view, quantum information applications are finally bringing 
quantum theory's most counter-intuitive characteristics to within reach of experiments. These 
developments engage the interest of the general public, and provide challenges to an expanding, 
interdisciplinary research community. It is a privilege to be doing research at a time when 
exciting quantum theoretical concepts are finally maturing into useful applications. 
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